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PREFACE 


The title of this volume is an abbreviation for the more 
properly descriptive one. ‘'Topics in the theory of approxi¬ 
mation”. It is a brief essay in a field on which an encyclopedia 
might be written. On the personal side, it is an account 
of certain aspects and ramifications of a problem to which I 
was introduced at an early stage, and which has given direction 
to my reading and study ever since. 

One day about twenty years ago I was admitted to the 
study of Professor Landau, seeking advice as to a subject 
for a thesis. After some preliminary inquiries as to my 
experience and preferences, he handed me a long' sheet of 
paper, and directed me to take notes as he enumerated some 
dozen or fifteen topics in various fields of analysis and number 
theory, with a few words of explanation of each. He told 
me to think about them for a few days, and to select one 
of them, or any other problem of my own choosing, with the 
single reservation that I should not prove Fermat's theorem, 
an injunction which I have observed faithfully. Guided partly 
by natural inclination, perhaps, and partly by recollection 
of a course on methods of approximation which I had taken 
with Professor Bdcher a few years earlier, I committed myself 
to one of the topics which Landau had proposed, an investi¬ 
gation of the degree of approximation with which a given 
continuous function can be represented by a polynomial of 
given degree. When I reported my choice, he said meditative¬ 
ly, in words which I remember vividly in substance, if not 
perfectly as to idiom: “Das ist ein schOnes Thema, ich beneide 
Sie um das Thema . . . Nein, ich beneide Sie nicht, aber es 
ist ein wunderschdnes Thema T’ It is in fact a problem which 
admits a surprising variety of interesting developments on 



IV 


PREFACE 


its own account, and offers a natural avenue of approach 
to a number of fields of still broader importance. 

Although delayed in its completion by the conflict of other 
duties, the following exposition is substantially in the form 
in which it was projected at the time of the Colloquium 
lectures m 1925, and presented in abstract m the lectures 
themselves. One section, on the vector analysis of function 
space, originally designed for inclusion in the Colloquium, 
has meanwhile been published separately instead. The sections 
which had been written at full length in September, 1925 
— practically the whole of the first chapter, parts of the 
second, and most of the third — have been left unchanged, 
except in minor details. The elementary account of Legendre 
series in Chapter I, for example, was written before the appear¬ 
ance of the admirable article on the subject by M. H. Stone 
in vol. 27 of the Annals of Mathematics. A few other articles 
published since 1925 are mentioned in the text. 

For the most part, however, citations of the literature have 
been omitted. The preparation of a really adequate bibliog¬ 
raphy would have been a task of such magnitude as to 
delay the publication indefinitely. References to some of 
the most important papers of not too recent date are con¬ 
tained in my thesis (Gdttingen, 1911) and in my report 
on The general theory of approximation by polynomials and 
t) igonomctnc sums in vol. 27 of the Bulletin of the American 
Mathematical Society. Among publications in book form 
supplementing the material given here, mention should be 
made of Borel’s Lemons sur les fonctions de variables reelles 
et les developpements m series de polynomes, de la Vallee 
Poussin’s Legons sur Vapproximation des fonctions d’une 
variable n'elle , and S. Bernstein’s Legons sur les propnetes 
extremales et la meilleure approximation des fonctions analytiqnes 
d’une variable reelle, all appearing in the Borel series. As 
to the content of these lectures themselves, there are many 
points where it would be difficult now to recall the original 
sources either of specific results and proofs or of suggestions 
as to method. To the extent that the work is my own, some 
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parts have been published previously, in my thesis, in various 
articles in the Transactions of the American Mathematical 
Society, and elsewhere; other parts are now ottered in print 
for the first time. Numerous detailed acknowledgments, not 
repeated here, have been made in the pages of the earlier 
publications. In connection with Chapter IT, reference should 
still be made to the work of Faber on trigonometric inter¬ 
polation in his memoir Uber stetir/e Fnnhtionen (zueite Ah - 
hancUung) in vol. 69 of the Mathematische Annalen. My 
acquaintance with the statistical formulas discussed in Chapter V, 
which might have come from any of a variety of sources, was 
m fact mostly obtained from Yule’s Introduction to the The - 
o) y of Statistics. The lemma on which the method of Chapter III 
depends is derived from the most important single memoir in 
the literature on degree of approximation, S. Bernstein's epoch- 
making prize essay of 1912, with which the present ivork also 
has other points of contact. And in conclusion it should be 
said that my study of the problem has been dominated from 
the beginning not only by the influence of my own teachers. 
but also by the writings of Bebesgue and de la Vallee Poussin. 

October 1, 1929 

Dunham Jackson 
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CHAPTER I 


CONTINUOUS FUNCTIONS 

Introduction 

Weierstrass first enunciated the theorem that an arbitrary 
continuous function can be approximately represented by a 
polynomial with any assigned degree of accuracy. The 
theorem may be stated with precision in the following form- 

If f(x) is a given function, continuous for a x b, and 
if e is a given positive quantity, it is always possible to define 
a polynomial JPQzi) such that 

\f{x) — Pipe) I Cs 

for a <- x b. 

To Weierstrass is due also the corresponding theorem on 
approximation by means of trigonometric sums: 

If f (x) is a given function of period 2 tz , continuous for 
all real values of x, and if e is a given positive quantity, it 
is ahvays possible to define a trigonometric sum T (x) such that 

L/C*) — T(x)\<e 
for all real values of x. 

By a polynomial is meant an expression of the form 
ao -f- tfi X -f~ 0*2 + * • - — a 7 iX n . 

This expression will be said to represent a polynomial of 
the ntli degree, not only when a n is different from zero, but. 
in distinction from the usage which prevails in some parts 
of algebra, also when a n = 0. That is to say, the words 
“polynomial of the nth degree” will be used in place of the 
longer expression “polynomial of the nth degree at most”. 
Even the case of identical vanishing is not excluded. A trig - 
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onometnc mm, or more specifically a trigonometric sum of 
the nth order, is an expression of the form 

ao cos x-\-az cos 2.r+ • • - + a n cos nx 
+ &i sin x-{-!)■>, sin 2x-j- ••• + &« sin nx. 

The definition is inclusive once more; the simultaneous vanish¬ 
ing of an and b n is not ruled out. 

These two types of approximating function show a persistent 
and fundamental similarity in their behavior, on which 
differences of more or less significance are from time to time 
superimposed. Simplicity of statement and proof will favor 
sometimes one and sometimes the other. 

It is readily seen that the number of terms required to 
yield a specified degree of approximation, or, under the 
converse aspect, the degree of approximation attainable with 
a specified number of terms, will be related to the properties 
of continuity of fix). It is the purpose of the next pages 
to trace out this relationship-in some detail. 

i. Approximation by trigonometric sums 

3?or a considerable body of results, the following theorem 
may be regarded as fundamental: 

Theorem I. If f(x) is a function of period 2 n, such that 

l/fos) —/(*i) | < *| xs—xt | 

for all real values of Xi and x%, 1 being a constant, there will 
exist for every positive integral value of n a trigonometric 
sum T n (x), of the nth order, such that, for all real values of x, 

\f{x)-T n (x )\^ 

Yb 

where K is an absolute constant, depending* neither on x, nor 
on n, nor on 1, nor on any further specification with regard 
to the function f (sc). 

In the proof of the theorem, use will be made of the 
following 
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Lemma. If m is a positive' integer, the expression 

sin*(ma'/2) 

sin 4 (aj/2) 

is a tnffonometnc sum m x, of order 2m — 2. 

Because of the identity 

cos px cos qx = -jr- [cos (p + q)X -f- cos (p — q)x\ 

and the others of similar type, it is seen at once that the 
product of two trigonometric sums, of orders n x and n» 
respectively, is a trigonometric sum of order n x -f -n 2 . It is 
sufficient for the purpose in hand, therefore, to recall any 
one of the numerous proofs of the well-known fact that 

sin 3 (m a;/2) 1 — coswa- 

sin 3 (x/ 2) 1 — cos a* 

is a trigonometric sum of order m —1; its square will then 
be a sum of order 2m — 2. The fact that 1—cos mx is 
equal to the product of 1 — cos a? by a trigonometric sum of 
order m —1 appears, for example, from the identities 

W —-1 

1 — cosmx = ^ [cospa;—cos(p + l)a?], 

p—0 

cos px — cos(p -f- l)a; 

= (l — cos a;) — ± [cos(g—l)a;—2eos qx 

< 7=1 

+ cos(g-f l)j“], 

cos (q — l)x — 2 cos qx + cos (g +1) x, 

— [cos(g— l)aj + cost'g , + l)a;] —2cosg.r 
= 2 cosga; cos a; — 2 cosga; 

= —2cosgas(l — cosjc). 

To proceed with the proof of the theorem, let 
F m {u) = > I™(x) = Jim, J_ n/2 f(x-\-2u) Fm(u)du , 
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where m is any positive integer, and km, is defined by the 
equation 

1 fnl 2 

XT 


By means of the substitution x-\-2u = v, the expression 
for J«i («) is transformed into 

Fm [t dv ' 

Both factors in the last integrand have the period 2rt with 
regard to v, so that the value of the integral is unchanged 
if the interval of integration is replaced by any other interval 
of length 2 tt. In particular, 

JotO») === ~2~ hm §_ n f 00 Fm (v *)J dv . 

The expression F m [£ (v — a?)], by the Lemma above, is a 
trigonometric sum of order 2m—2 in (v — x), and may be 
regarded as a trigonometric sum of the same order in x, 
with coefficients which are trigonometric functions of v. The 
whole integrand is a trigonometric sum in x with coefficients 
which are continuous functions of v, and I m (x) therefore is 
a trigonometric sum of order 2 m — 2 in x, with constant co¬ 
efficients. The proof that this sum is an approximate rep¬ 
resentation of fix), when m is large, will be based on the 
original representation of lm(x). 

Let the equation defining h m be multiplied by bn fix'). 
Since /(re) is a constant as far as u is concerned, it may be 
placed under the sign of integration, so that 

fix) Fm (w) dw* 

Consequently 

Im{x)—f{x) = hm [/ (as+2 tt) —f (re)] F m (m) du. 

By the hypothesis imposed on fix), 

|/0r+2u)—/(*)] ^ 2l\u\. 
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Hence 


pw/2 


\Im{x)—f{x) I <; 2 1hm j__ nl2 \u I F m (m) dZl, 
or, since F m («) and | u | F m (u) are even functions of u, 


pvm 


\I m (x) —/(»)|<4A7i m J o uF m (u)du = 2A 


pw/2 

I uF m (u)du 


Jo 


J* ' F m (.u)dn 


To anticipate the conclusion of the proof, let 


Ci 


r 


:/s sin 4 1 

¥ 


at, 


<■‘2 


sin 4 * 


at. 


These quantities are merely numerical constants. Is is clear 
that each integrand approaches a limit for t — 0, and that 
the improper integral defining c% is convergent. 

By the use of the fact that 0 <! sin u < u for 0 < u < tt/2, 
and the substitution mu = t, it is recognized that 

rr^tT«*„= a r-ist* 

Jo Jo L mu -I m Jo t 

j_ r 2 A in< ^ 

— m Jo 


dt 


Cl 

m 


On the other hand, (sintt)/t< decreases monotonically as u 
goes from 0 to ?r/2, so that 

1 


s in u sin (?r/2) _ 2_ 

{ttI 2) n ’ 


u 


< 

sin u 2 u 


Hence 


throughout the interior of this interval, 


< 


From these relations it follows that 


Ci 

m* 




8 Ci m 
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Now let n be an arbitrary integer, and let m be taken 
equal to 1 or \in-\- 1), according as n is even or odd. 
In either case, 2m — 2 < n<2m. Let the corresponding 
expression (x) be denoted by T n (x). Then T n {x) is a 
trigonometric sum of the nth. order (it will be remembered 
that this is understood to mean of the nth order at most. 
according to the more usual terminology), and, since l/»i-C2/n. 


lm (<r) fif) 


rt^G, 2 A 
4 c a n 


~n ’ 


if K is taken equal to 7r 4 r 2 /(4ci). Thus the proof of the 
theorem is completed. 

So much has been conceded to simplicity of outline, in 
building up the above inequalities, that the final upper limits 
are quite unnecessarily large, giving little indication of the 
actual magnitude of the quantities that precede. It will 
add a little to the definiteness of the conclusion to point 
out that Cx > {2 Inf, since (sin t)/t > 2/n throughout the 
interior of the interval of integration, while 


C\ sin 4 1 , P“sin 4 tf 7J ^ 7J , r °°dt 

”J. — dt< Jo idt+ J, T» = *' 


C 2 

so that 


K = 


7T 4 6‘ s 

4 Ci 



100 . 


With more attention to detail, however, the estimate can be 
cut very much closer. The theorem is actually true with 
K — 3, instead of the value adopted above, or even with 
a somewhat smaller value of K. On the other hand, it can 
be shown that the statement is not generally true with a value 
of K smaller than n/2. 

To pass on to a more general theorem, let fix) be an 
arbitrary continuous function of period 2n, and let a>($) be 
the maximum of \fix%) —/(a*) | for \x s —a* | < d. The 
function to (d) has been called by de la Valine Poussin the 
modulus of continuity of fix'). With the word maximum 
replaced by least upper bound, it can be defined for any 
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bounded function, whether continuous or not. The character¬ 
istic property of a uniformly continuous function is that 
lim^o ®(<J) = 0. 

Let sp (x) be the continuous function of period 2 tc which 
takes on the same values as f (x) at the points 


■7T, 


■ 7t - 


2 TC 


n 


■ 71 ■ 


4?r 


n 


7T- 


2tt 


n 


it. 


and is linear from each point of this set to the next. The 
graph of f (x) is a broken line, no segment of which has 
a slope greater than a>i2icln)l (2tt/w) in absolute value. 
In analytical language, <jp (x) satisfies the hypothesis of 
Theorem I, with 

. _ a)(2yr/w) 

2 nln 


Eor every positive integral value of n, therefore, there is 
a trigonometric sum T n (x), of the wth order, such that 

I rte—r.W 

On the other hand, any specified value of x differs by less 
than 2 nIn from one of those for which / and y are by de¬ 
finition equal to each other; neither f{pc) nor y> (x) can differ 
by more than ®(2 n/n) from the corresponding common value; 
and hence 

\f{x) — y{x)\ < 2co 

for all values of x. If the quantity .K 7 ( 27 r) + 2 jg denoted 
by K', the last two inequalities may be combined to yield 
the following statement: 

Theorem II. If fix) is a continuous function of period 
2n, with, modulus of continuity «(d), there exists for every 
positive integral value of n a trigonometric sum T n ix), of 
the nth order, such that, for all real values of x, 

\fix)-Tnix)\ < K’u (--), 
where K' is an absolute constant. 
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While this theorem is applicable to any continuous function, 
it involves the modulus of continuity in the inequality which 
forms the essence of its conclusion. It can he shown that 
the assi g nment of an outer limit of error for an arbitrary 
continuous function, without some dependence on properties 
of the function beyond the mere fact of its continuity, is 
impossible. 

Since lim n =oo oo (2 nl ri) — 0, it is to be noted that 
Theorem 13 includes one of the theorems of Weierstrass to 
which reference was made in the opening- lines of the chapter. 

In preparation for *the next developments, there is occasion 
to examine more closely the proof that was given above for 
Theorem I. It will be recalled that to an arbitrary positive 
integer n a second positive integer m was assigned, in terms 
of which a function F m (u) was constructed; and a trigono¬ 
metric sum T n (x), yielding an approximate representation 
of the given function f(x), was defined as equal to an ex¬ 
pression which could be reduced to the form 

\ 1lm \ Fm [ 4 "^—*)] dv ’ 

hm being independent of x. A lemma stated essentially that 
F m (4 m) is a trigonometric sum in u, of order 2 m — 2 ^ n. 
It is possible therefore to write F m [4 (w—a;)] in the form 

1 n 

AnO + ' y] [An* COS k (v — x) -f BnJc sin k (v — *)]. 

A *=1 

When the above expression for T n (x) is expanded as a trig¬ 
onometric sum in x, the constant term is 

^hmAna JT/(t0 dv, 

and is zero if ike last integral vanishes, an observation which 
will presently be important, for the reason that the indefinite 
integral of a trigonometric sum without constant term is 
itself a trigonometric sum, while this is not the case if the 
sum to be integrated has a constant term different from zero. 
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It may be pointed out in this connection, though it is not 
essential to the main argument, that the coefficients JBnk are 
all zero. This can he inferred from an elementary theorem 
on trigonometric stuns, since F m (\u) is an even function of «, 
and is also directly apparent on inspection of the proof of 
the lemma. If aje, bk are the Fourier coefficients of fix): 

Ok — — \ fiv) cos Jcvdv, bu — — fiv) sin kv dv, 

TC d—71 Tt d—71 

and if \nh m A n k is denoted by dnk, it is seen that 
1 n 

T n (x) — ~n~dno op 4- S dnk(ak cos kx 4- bk sin kx ). 

* &=i 

As the d’s are independent of the function to be represented, 
the calculation of the successive expressions T n (x) amounts 
to a method of summation of the Fourier series for fix). 

To return from the digression of the last paragraph, let 
fix) be a function of period 2n, which has everywhere 
a continuous derivative fix). For a particular value of n, 
let 4 (a?) be a trigonometric sum of the nth order, without 
constant term: 

n 

4 (a?) —2 («fc cos kx + Ac sin for), 

k—1 

and let e n be a constant such that 

\f(x) — 46») | ^ e n 

for all values of x. Let tn{x) be the trigonometric sum, 
without constant term, which has 4(a0 for its derivative: 

tn ix) = ^ sin kx -cos kasj , 

and let r n (x) = fix) — t n (x). Then r n (x) has the period 2 n, 
and, since |r»(a?)| < s n , satisfies the conditions imposed on fix) 
in the hypothesis of Theorem I, with X = s n . Hence there 
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exists a trigonometric sum of the wth order, which may he 
denoted by r n (x), such that 


r n (x) — T n (x) \ ^ 


n 


If Tn (x) = tn (x) + T n (x) , then / (x) — Tn (x) = r n (x) — T n Or) > 
and 

i e n 


\f(x)-T n (x)\ < 


n 


From the existence of an approximation for f (x) it has been 
possible to draw an important inference with regard to the 
approximation of f(x) . If fix) is itself the derivative of 
a function of period 2rc, so that the integral of f (x) over 
an interval of length 2tc is zero, it follows that 

J^ n r n(x) dx = §_ n f(x) dr — t n (x) d x — 0, 

whence, according to the second paragraph preceding, the 
sum r n (x) given by the proof of Theorem I as an approximation 
for r n (x) will have no constant term. So the constant term 
in the present T n (x), defined in terms of this ■*»(*), will be 
zero likewise. 

The way has now been prepared for a demonstration of 
Theorem HE. If fix) is a function of period 2 tv, having 
a pth derivative fW (x) such that 

\f {p) (x*)—f l ' p) (xi) | <; X\x s — x t \ 

for all real values of och and x 2 , X being a constant, there will 
exist for every positive integral value of n a trigonometric sum 
Tn(x), of the nth order, such that, for all real values of x. 


\f(x)—T n (x)\ 


jsy+u 

n v+l 


where K is the absolute constant found in the proof of 
Theorem I. 

It is to be noticed that the argument is based on the ex¬ 
plicit construction of the approximating sum in Theorem I, 
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and it is not clear that a smaller value of K in that theorem, 
justified by some different method, would necessarily he 
applicable here. The value K = 3, previously mentioned 
in connection with Theorem I, results from the same explicit 
construction, and is available in Theorem m. 

By Theorem I itself, there exists a sum T»i (x) such that 

\fW{x) — Tm(oc)\ £ 

lb 

On the basis of the more recent discussion, as 

§_ n f (p) 6*0 dx = (»)— f(P~v (— n) — 0, 

it may be understood that T n i (x) has no constant term. 
Since (x) is the derivative of the periodic function 
ftp-u(x), furthermore, an approximating sum T n s (x) may 
be constructed for f (p ~ v (x), as indicated above, with 
e„ — KMn, and 

(a?)— T,i& (as)| < 


If jfj > 1, ftp-v (x) is itself the derivative of the periodic 
function f (p ~~ 2) (x), and the constant term in T„o (x) is zero. 
By a sufficient number of repetitions of the process, the 
theorem is established. 

In Theorem II, even if the integral of / ( x ) over an inter¬ 
val of length 2 pc is zero, the same thing is not necessarily 
true of the auxiliary function y(x), and it is not clear that 
the approximating sum in the conclusion of the theorem will 
lack the constant term. The difficulty is not a serious one, 
however. If 



let <jpi (x) = $p (x )— c/(2tt). Then 



0 , 


and there is a trigonometric sum T n i (x ), without constant 
term, such that 
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!.«-»)I £-^r » ( 4 T) ■ 


But on the hypothesis that the integral of fdx over a period 
is zero, 

|c| = = |J£,&>(»—/(*)]«* 


< Sir.8.(lA), 


and 

Iso (as)— SPx (*)1 ^ 2ft) , |/(a;)— SPx (a;)| < 4® . 


So the conclusion of Theorem IT applies to the approximate 
representation of fix) by a trigonometric sum without constant 
term, when fix) is the derivative of a periodic function, on 
the condition merely that K' — Ki (2 ?r) -|- 2 be replaced by 
JT'=.Ky(27r)+4. 

The same process of induction which was used to prove 
Theorem HI then serves to establish 
Theorem IV. If fix) is a function of period 2n winch 
has everywhere a continuous pth derivative, with modulus of 
continuity ®(d), there exists for every positive integral value 
of n a trigonometric sum T n (x), of the nth order, such that, 
for all real values of x, 

I/«-r»fe)| < K f* 


where K is the absolute constant given by the proof of Theorem I, 
and K" — K/i2n)-\-^ k 

A part of the content of this theorem may be restated 
in the following 

Corollary. If fix) is a function of period 2n which has 
everywhere a continuous pth derivative, there exists for every 
positive integral value of n a trigonometric sum Tnix), of the 
nth order, such that 

lim n* e n — 0, 

n=s oo 

if e n is the maximum of | fix) — T n ix)\. 
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a. Approximation by polynomials 

The development of the theory of approximation by trigono¬ 
metric sums -will be interrupted at this stage, to make way 
for a presentation of the corresponding results with regard 
to polynomial approximation. The transition will be aided, 
however, by one more lemma on the trigonometric side. 

Lemma. If fix ) is an even function of period 2 tv, and 
if there is a trigonometric sum T n (x), of the nth order, such 
that | f{x) — T n (x ) | <j e for all values of x, there exists a cosine 
sum C n (x) of the same order (that is, a trigonometric sum 
without sine terms), such that, for all values of x, 

| fix) — Cn (x) I e. 

When f(x ) is even, the approximating sum given by the 
proof of Theorem I will automatically lack the sine terms, 
as an immediate consequence of the fact, already pointed 
out that its definition is equivalent to a method of summation 
of the Fourier series; and this observation would be sufficient 
for the main argument; but it is of interest to note that the 
lemma subsists independently of any particular mode of con¬ 
struction of the original approximating function. 

For the proof, let 

Cnix) = -[Tn(x)+Tni—x)]. 

Then C n (x) consists merely of the cosine terms of T, »(«), 
without the sine terms. On the other hand, since fix) is 
even, 

fix) = y [/(*)+/(— a )l > 

and therefore 

\ fix) —Cnix) j = y [/(«) — Tnix)] + y [/(—-*0 — Tni—X)] 
^ e. 

Now let fix) be a function defined for — 1 < x <[ 1, and 
subject to the condition 

\fixz) —fix i)i < h\x a — X L \ 



14 


THE THEORY OF APPROXIMATION 


throughout this interval. Let 

r = cos 9, /O) = /(cos e) — sp(0). 

Then <p(6) is an even function, defined for all real values 
of 6, and 

1 9 ° (0s) — 9 °( 0 i)| = |/(cos e a ) — /(eos0i)| 

< X | cos 9 S — cos $ 1 1 <1 X\9s — 9i I. 

By Theorem I, together with the lemma just proved, there 
exists a cosine sum C n (6), of the nth order, such that 

But a cosine sum of the nth order in 9 is a polynomial of 
the nth degree in cos 9, which may be denoted by P n (x), 
and the conclusion is that a polynomial P n (x) exists such 
that 

l/W-AWI < ~~ 

for —1 < a? < 1. 

If the interval (—1, 1) in the hypothesis is replaced by 
an arbitrary interval (a, b), a preliminary transformation of 
variable may he made according to the formulas 

y =——i /(»)=/ iy), 

whereby / (y) is defined for — 1 < y < 1, and 

!/x <2/a) —ft O/i) I 

= \fM~ /fo)| < X\x s — Xt\ = — a) \y% — y t \. 

The result just obtained may then be applied to the approx¬ 
imation of ft (y) by a polynomial in y, which is at the same 
time a polynomial in a;. The general conclusion may be 
formulated as 

Theorem V. If f(x) satisfies the condition 

l/(^s) — f(xi )I < X\x t — Xy\ 
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throughout the closed interval ( a, h), of length l, there exists 
for every positive integral value ofna polynomial P n (x), of 
the nth degree, such that 


\f(f) — Pn(x)\ 


Lll 

n 


for a < x < b, with L — \K, where K is the constant of 
Theorem I. 

If the smallest possible value of K were found in Theorem I, 
it is not clear that JT would then be the smallest admissible 
value of L, hut it can be shown that the validity of Theorem V 
is not general for any L smaller than 

More generally still, suppose that f(x) is an arbitrary con¬ 
tinuous function for a<x^.b, and let 00 (d) be its modulus 
of continuity in this interval. With b — a —l, let g>(x) be 
the continuous function which takes on the same values as 
f(x) at the points 


, 2 ,2 1 

a, fl-i-, <H-, 

’ n ii 


■, b——, b, 
n 


and is linear from each point of this set to the next. The 
function g> (a?), having a broken line for its graph, satisfies 
the hypothesis of Theorem V, with 


while 


, _ a (l/n) 

l/n ’ 

j f{d) — y (x) | 2 oo (l/n) 


throughout (a, 5). There is a polynomial P n (x) such that 

\<p(x)—P*(x)\ ^ Loo (l/n)-, 

setting L-\-2 = L r , one may state 
Theorem VI. If f(x) is a continuous function with mod¬ 
ulus of continuity <a(d) in the closed interval (a, b), of length 
l, there exists for every positive integral value of n a poly¬ 
nomial P n (x), of the nth degree, such that, for a <j as < l, 

\f(x)-Pn(x)\ < L' CO (l/n), 

where U is an absolute constant. 
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This incidentally includes the theorem of Weierstrass on 
polynomial approximation, which was quoted at the beginning 
of the chapter. 

The proof of a theorem corresponding to Theorem in is 
simplified by the fact that the indefinite integral of a poly¬ 
nomial is always a polynomial, so that special considerations 
analogous to those relating to the constant term in the trig¬ 
onometric case are not needed. A new complication is 
introduced, on the other hand, by the circumstance that the 
degree of a polynomial is raised by integration, while the 
order of a trigonometric sum remains unchanged. 

Suppose that/(as) has a continuous derivative /'(a?) for 
a < x ^ b, and that there is a polynomial ph (x ), of degree 
n —1, such that 

|/' (x)—p' n (as)| <; e n 
throughout the interval. Let 


J^ph (x) dx — Pn (pc) , / (x) —Pn (x) = r n (x ). 

Since | r' n (x) | e n , r n (x) satisfies the hypothesis of Theorem V, 
with X = e n . There is consequently a polynomial Jt n (as), 
of the nth degree, such that 

L 


\r n (x) — n n (x)\ < 


n 


If Pn. (x) + n n (x) — P n (x), this P n (x) is a polynomial of the 
nth degree, and 


[/(as) —P n (as) | = | r n (x) — rt n (x) \ S ~~~ • 

rt 


Let /(as) have a pth derivative / (i,) (as), satisfying the 
condition that 

l/<*<«*)-/<*fo)I < Mxi-xxl 


throughout (a, b). If n— p>0, there is a polynomial of 
degree n—p which differs from /(as) by not more than 
LlX/(n — p) throughout the interval. There is then, by the 
preceding paragraph, a polynomial of degree »—1, 
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differing by not more than £ 2 J s A/[(m— p)(n —p + 1)] from 
ftp-v(x), and so on. Finally a polynomial P n (x) is obtained, 
of the «th degree, for which 


f(x)—P n {x)\ < 


J>+1 P+ 1 X _ 

( n — p){n —1> + 1) • • • n 


For the applications, the existence of a constant in the 
light-hand member of the last relation is far more important 
than any close estimate of its numerical value. It is most 
convenient, even at considerable unnecessary expense nume¬ 
rically, to be satisfied with the observation that 

_1_ n n 

( n — p) (n —p -(-1) • • • n n —p n — p -f-1 

n 1 ^ _1 

n —1 ’ n? +1 — p\ ' 

for n p +1, and to state the result in the form of 

Theobem VII. If fix) has a pth derivative f^ix) satisfying 
the condition that 

|/^(a?a) ~f^{x t ) j <; h\x i —x 1 \ 

throughout the closed interval (a, l>), of length l, there exists 
for every integral value of n >p a polynomial P n (a?), of the 
nth degiee, such that far a < x ^ h, 

where L p — ip +1) ? 2> +1 / p !, and L is the constant of 
Theorem V. 

It is clear that even with 1 = 0 the hypothesis implies 
nothing whatever as to the possibility of approximating fix) 
by a polynomial of degree lower than p, since fix) itself 
may then be any polynomial of the j>th degree. By suitable 
changes in formulation it would be possible, though of 
secondary interest, to admit the value n — 0 in Theorem V 
(or Theorem I), and the value n — p here. 

From Theorem VI, by reasoning similar to the above, one 
may deduce 


i 



18 


THE THEORY OF APPROXIMATION 


Theorem VIII- If fix) has a continuous ptk derivative unfit 
modulus of continuity co(d) throughout the closed interval ( a, h), 
of length l, there exists for every integral value of n>p 
a polynomial Pn (x), of the nth degree, such that for a^x<b. 

!/(*)—A«|£ 

■where L r v — (p-\- l )*’ -1 L* (L-\-2)/pl, and L is the constant 
of Theorem V. , 

Corollary. If f(x) has a continuous pth derivative for 
a<x<>b, there exists for every positive integral value of n 
a polynomial P n (x), of the nth degree, such that 

lim riPe n — 0 , 
n= oo 

where e n is the maximum of \f(x) —P»( x)\ in the interval 

{a, b). 

The exceptional status of the values of n < p has no 
significance for the corollary, which is concerned only with 
a limit for n — oo. 

3. Degree of convergence of Fourier series 

The preceding theorems can he made to serve as basis 
for a discussion of the convergence and rapidity of convergence 
of Fourier and Legendre series, as well as of other processes 
of approximation. This idea will be developed more fully in 
succeeding chapters; its first consequences will be presented here. 

With regard to the Fourier series for a given function f(x), 
it will be premised merely that it is a series of the form 

n 00 

- 5 - 2 (a* cos kx + Ik sin lex), 

* k=l 

in which the coefficients have the values 

«fc = J^^fit) aosktdt, he — — am kt dt, 

the formula for a» being applicable when h — 0 , as well 
as when k is positive. The expression 
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S n (x) = -77- + 2 Ofc cos kx + 6/c sin kx) 

& k=l 

will be called the “partial sum of the series to terms of the 
nth order.” 

If the quantity 

+ cost* + cos 2« + + cosmt 


is multiplied by 2sin£n, the product may be rewritten as 



which immediately reduces to sin (w + i) u, so that 


1 - . „ , , sin(n4-4)« 

2 2smitt 


Let each coefficient in S n ( x ) be replaced by the integral 
expression which defines it. Since cos kx and sin kx are 
independent of t, they may be written under the sign of 
integration, and the various integrals may be combined into 
a single one: 


8 n (x) = ~ (cos Jet cos kx + sin k t sin free) j d t 


1_ C n sin(n + fr) jt — x) 
nr J—2 sin£(2— x) 


dt. 


From the last expression may be deduced the following: 
Lemma. If fix), of period 2 n , is bounded and integvable 
(in the sense of Riemann or in the sense of Lebesgue), if 


1 / 0)1 < M 

for all values of x, and if Sn (x) is the partial sum of the 
Fouriei' series for f (as), to terms of the nth.order, then 


2* 
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l&fcD! ^ CMlogn, 


fw all values of x and far all values of n > 1, where C is 
an absolute constant, depending neither on x, nor on n, nor 
on the function fix). 

By the -hypothesis on fix), 




sin (n -|- jt — x) 
2 sin 4 (t — x) 


dt. 


If one-half the integral on the right is denoted by j n , its 
form may be modified by the substitution u — ^(2— x), and 
by recognition of the fact that the resulting integrand is an 
even function of u, of period n, to yield the conclusion that 


_ 1 r*vc—x)i2 

^ 2 J —(7r+as)/2 

_f»w /2 | sin (2 n + 1 )u 

sin w 


2 J —7 


nl2 I 


sin (2 re +1 )u 
sin u 

■/a | si n (2 n +1) u 
sin« 


du 

nu = r' 2 !^ 

Jo I SI 


\Sn(x)\ 


IT 


Prom the fact that 

8in(2«+l)» _ i + 2 

orn m 1 


it follows that 


sinw 

sin(2n j- l)n 
sintt 


*=i 


2n -f-1 


du, 


for all values of u. On the other hand, | sin (2w +l)t< I 
while 

sintt ^ sin (w/2) 2_ 1 

u — 7 t/2 n ’ sin m 

throughout the interval of integration. So 




— 2 n 


= 2 +i+f l 0 sf+f l0 *™- 


•/a dii 

tin U 
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The last expression does not exceed a constant multiple of 
log n, for n 2, and the conclusion of the lemma is justified. 

The method of application of the lemma, which is due to 
Lebesgue (though he did not make so extensive use of it as 
is done here), may be summarized in 
Theobem IX. Iff (as) is a continuous function of period 2j t, 
and 8 n (x) the partial sum of its JPownei series to terms of 
the nth order, n~> 1, and if there exists a trigonometric sum 
T n (x), of the nth order, such that 

\f(x )— T n {x) | < e„ 

for all values of x, then, for all values of x, 

\f{x) — £»(sc)| <; Be n \o%n, 

where B is an absolute constant. 

The statement is equally true, though of less interest, if 
/(as) is merely assumed to be integrable; of course e„ can 
not approach zero, when the relations are considered for 
successive values of n, unless f(x) is continuous. 

Let the sum T n (x) in the hypothesis have the expression 

n 

+ 2 (“ft cos kx + fa sin hoc ). 

2 k =l 

It is found by direct integration that 
— f 71 T»(<)cosA:< dt — ce k , — f” T n (t) sin&tf dt — fa, 

7t J—n n J—n 

for kfLn, so that the partial sum of the Fourier series for 
T n (x), to terms of order n, is identical with T n (x) itself. IE 
the Fourier coefficients of fix) are a*, b k , those of the function 

Bn(x) — fix )— T n (x ) 

are a*— a k , bk — fa, for Jc n, and the partial sum s n (x) 
of the Fourier series for Unix), to terms of order n, is 

s n (x) =-4-(«o—«o)+2 [(a* — “fc) cos kx + (bk — fa) sin kx] 

6 k=l 

= 8n(x)—T n (x). 
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Since \R n ix)\ <; e n , it follows from the Lemma that 
| s n 0*0 | <; Ce n log n. 

Consequently 

\f(x)-S n (x) I ^ \f(x)-T n (x) I + I Tn(x)- 8 n (x) i 

= \Rn(x)\ + \s n (x)\< 6 n- i rCe n \o%n<.e n + Cs n \ogn 

for n 2 , and the last expression has the form Bs n log n, 
with B — (log 2) -1 + C. 

The theorem may immediately be specialized and made 
more definite by combination with Theorems I—IY, as follows: 
Corollary I. If 

I/O**) —/0*OI <1 X\sc*—Xi | 
for all values of Xy and x a , X being a constant, then 

i/w—a.(«)i £ . 

Corollary II. If fix) is continuous with modulus of con¬ 
tinuity oo ( 3 ), 

1/0*0 —/SW (x») I <i Aa> log n. 

Corollary Ha, The Fourier series converges uniformly to 
the value fix), if fix) has a modulus of continuity at (d) 
such that lim^-_ 0 <B(d) log 8 = 0 (Lipschitz-Dini condition). 

Corollary IQ. If f (x) has a pth derivative fW (x) such 
that 

\f W («a) ~f (p) (*i) | ^ h\Xi—Xi\ 
for all values of Xt and Xs, X being a constant, then 

I/O*)-a, fa) i < . 

Corollary IY. If fisc) has everywhere a continuous pth 
derivative with modulus of continuity at id), 

1/0*0 8 n 0*01 < ^-ft»|-^-)logn. 
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In each of these statements, the conclusion holds for all 
values of x, and for all values of n^ 2; the coefficient A is 
an absolute constant, while A p depends only on p. 

The corollaries are stated separately for emphasis; it is 
clear that all are included in Corollary IV, if it is under¬ 
stood that p may in particular have the value 0. The use 
of the same A in I and II, and of the same A p in HI and 
IV, signifies merely that when two constants are concerned, 
one symbol may he used to represent the larger of them. 

It is worthy of note that even the dependence of A p on 
p can be eliminated, with a resulting simplification in Cor¬ 
ollaries HI and IV which does not have a counterpart, as 
far as the present evidence goes, in the case of the corre¬ 
sponding third and fourth Theorems. 

If fix) is a function of period 2?r having a continuous 
derivative, the Fourier series for fix) is that obtained by 
formal differentiation of the Fourier series for /( x). This 
is recognized without any further assumption as to the con¬ 
vergence of the series, from the relations 


cos 7c t dt — k J^/(t)sin&2 dt, 

J^ 7 l f > it) sin Ict dt = —k J^^f(t) cos ktdt. 


which axe obtained by integration by parts, the terms which 
would appear outside the integral sign reducing to zero, 
because of the periodicity of the functions involved. If the 
coefficients in the series for fix) once more are a*,, bk, and 
if fix) has a continuous derivative of order 2 q, where q is 
any positive integer, the Fourier coefficients for f 2 ^ (a?) are 
a k = (— aic, fik = (—1)® fc 2 ® bk, which means that the 
series for fix) can be written in the form 


— 1)® 2 Tsy ( a * cos + fa sin kx)- 

2 AT* 


The partial sums of the series for fix) and for jf®® > (a?), to 
terms of the wth order, may be represented by S n ix) and 
Sn 9> ix) respectively. 
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Let 

Or) = *.(«), 

and suppose now that |e»(as)| < *», where «»+i<£» for all 
values of n that are considered, and lim«=oo «» = 0. Then 

ajc cos lx + file sin 7{x — qjc —i (cc) — Qk (a?) . 

By any of the preceding corollaries, IIa, for example, the 
series for fix) converges to the value of the function, while 
the series ^Qk-i {x)f^, (x)/k^ are convergent, beeause 

| qjc (x) | is uniformly bounded. Hence the remainder f(x) — 8„ (x) 
can be written and rearranged as follows: 

fix) — B n (a?) — (—1)« 2 COS kx + fik sin kx) 

= (— i)* 2 w 0*) ~ $ k (*)I 

fc=n+l 

= (-!)««[- 4 ) ^ ' 

(w~ (Jr + J)») et(!c) ]• 

As the parentheses in the last summation are all positive, 
and as |^fc(a;)| ^ e*, <1 e n for 1c > n, 

\Ax) - S n (x) | ^ ( w + ij* + '» k f£ +1 (pF (k + lf*) 

_ 2 2 fi,i 

~ (w + l) 2 * ^ « 2 *' 

Under the hypotheses of Corollary HE, letjp = 2g if p is even, 
p ~ 2g + l if p is odd. Then Corollary I or Corollary HI 
may be applied directly to / (2?) (r), which is found to satisfy the 
requirements of the preceding paragraph, with e n — (AXlogn)/n 
in one case and e n — {A x X log n)/n s in the other. The 
corresponding values of 2 sjn 2 « are (2 AX log wj/n» +1 and 
(2 X log n)/nP +1 respectively. Bor literal accuracy, it must 
be admitted that (lo gn)/n diminishes only from n — 3 on, 
and that the conclusion has been established, when p is even, 
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only for to ^ 3. The value to = 2 may be included if the 
previous e a is replaced by a somewhat larger quantity, with 
a suitable adjustment of the constant in the conclusion, if 
necessary. 

Similar reasoning may be employed under the conditions 
of Corollary IV, except that it is no longer satisfactory to 
take 2q = p when p is even, as w (2 to/to) log to might not 
decrease with increasing to. It is sufficient, however, to let 
p = 2g+2 when p is even, p — 2g+l when p is odd, and 
to obtain s n by the application of Corollary IV to (x) 
as thus defined. From the definition of «(<5) it is certain 
that m (2 n /to) itself diminishes, or at any rate does not 
increase, as w increases. 

All the cases in question are covered, with some redundancy, 
if the following are considered successively: for «(<J) < Id, 
p = 0, 1,2 q (q ^ 1), 2g+l (q ^ 1); for general ®(d), 
p = 0, 1, 2, 2g+l 2q-\-2 (q^> 1). If a single letter 

is used to represent the largest of the finite number of con¬ 
stants entering into the corresponding conclusions, the result 
may be formulated thus- 

Theorem X. The preceding Corollaries III and IV may be 
restated, for all values of p^. 0, with the multiplied’ Ap replaced 
by an absolute constant D, depending neither on p nor on 
anything else. 

4. Degree of convergence of Legendre series 

A considerable part of the above reasoning can be carried 
over to the case of Legendre series, though the relations 
are less simple than for Fourier series, and the results as 
presented here will be less complete. 

By the Legendre series for a given continuous function flpc) 
is meant a series of the form 

Oo X 0 (sc) -4- «i Xi (sc) 4- a 2 X s (x) H- 

where Xk(sc) is the Legendre polynomial of theftth degree, and 
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Not to work out the theory of these polynomials from the 
beginning, it 'will be assumed as known that one of them 
is defined for each integral value of & 0, the first two 
being X 0 = 1, X t = x\ that they satisfy the relations 

J^X/OO X k (t)dt = 0 (j * k), J\Ut)dt = y£+'i' 

that any successive three of them are connected by the 
recursion formula 

(fc + 1) X, £+ i (x) — (2A- + l)x X k (x) + kXk-xix) = 0; 

and that the polynomial of the Mh degree can be expressed 
in the form 

Xk(x) — £ i x +* (1 — sc s ) 1/2 cos <p]* d<p. 

In the last expression, the presence of imaginaries is only 
superficially apparent; if the integrand is expanded by the 
binomial theorem for a positive integral exponent, the co¬ 
efficient of each odd power of i after integration is an integral 
which is seen at once to be equal to zero. 

Let 8 n (x) stand for the sum of the first n-f-1 terms of 
the series: 

Sn (x) = do X 0 (x) -j- cii X% (x ) -J- •••■+■ Un Xn (a?). 

By the definition of the coefficients, 

Sn(x) = t)dt, 

where 

K n (x, f) = y [Z 0 (x) X 0 (t) + ZXi(x) X x (0+ • • • 

- ^(2n+l)Xn(x)X n m. 

This function can be rewritten in the form 


K n (x, t) 


Xn-f-i (x) Xn (t) Xn (x) ( t) 

2 x—t 
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the identity is immediately verified for n — 0, and may then 
be proved in general by a straightforward process of induction, 
based on the recursion formula. 

In the integral expression for Xk(x), when [a- 1 <j 1, 

| as+«(1 — as 2 ) 3 -/ 2 cosy | = [* 2 + (l— a; 2 ) cos 2 ?] 1 ' 2 

S l^+U—as 2 )] 1 ' 2 = 1, 

and hence | Xk(x) | 1. There will be occasion to use a closer 

inequality for | Xu | in the interior of the interval (—1, 1). 
To return to the integral representation, 

| (as) i ^ - C n [as 2 + (1 —a- 2 ) cos 2 y]** d y 
71 Jo 

o C m2 

== — I [as 2 + (1— as 2 ) cos 2 y] &/3 d y , 

TZ U 0 

the last equality resulting from the fact that cos 2 y — cos* (rr —y). 
Since 

as 2 +(l— as 2 ) cos 2 y = cos 2 y + as 2 sin 2 y — 1—(1—as 2 ) sin 2 y. 

and since sin y ^ 2 y / n throughout the interval of integration, 
while 1—as 2 is positive for the values of as under considera¬ 
tion, it follows that 

as 2 H- (1 —as 2 ) cos 2 y 1- %- (1 —a: 2 ) y 2 = 1 — 5 2 y s , 

TV 

if 5 = 2(1 —as 2 ) 1/2 /jr. By an application of the extended 
mean value theorem to the function e~ y , 

e-y = i-y+\y*e~ e y, o<e<x , 

so that e~ y i> 1 —y for all real values of y, and in the 
present connection 

1 — $V < 

So 

I Xk&) | < = ifo e ~ kit<P ' lZdtp ’ 
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and, by the substitution u = (£Zr) 1/2 

03/2 C°° Ci 

I -2fc (*) | ^ n Jo e_14 =: 7,-i/2(i a;*) 1 / 2 0), 

where Ci is independent of k and x; the numerical value 
f>! = (tt/2) 1/2 is not essential for present purposes. 

From the relation just obtained it follows, first, that if x 
is restricted to an interval — i + if 1— hv > 0 <y <2, 

then 

1X^)1^-^, 

where g is independent of 7c and x, but depends on »?; and 
secondly, that 

J\j-S*(*)l dx ^ 'jp'Xi (1 — cc 2 ) 1 / 2 = P*" 5 

where Cs is independent of 7c. 

It is possible now to proceed to the proof of the following 
lemma, which assigns an upper bound for |/ 9 «(aj)|, not through¬ 
out the entire interval — 1 < x <£ 1, to be sure, but through¬ 
out an interval interior to it: 

Lemma. If f(x) is hounded and mtegrahle (in the sense 
of Biemann or in the sense of Lebesgue) for — 1 < x <: 1, if 

!/(*) I ^ M 


throughout the interval, and if S n (x) is the ’partial sum of 
the Legendre series for fix), then 

|&»te)| < GMlogn' 

\ t , 
for — l-j-ij <x<l — n, 0 < ^ < 1, where G does not depend 
, on x, n, or the function fix), hut does depend on n. 

The proof starts from the. fact that 

1 8 nix) I < M I Unix, i) | d t. 


Let the interval of integration be divided into five sub-intervals, 
terminated by the points 
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Under the hypothesis that — 1 + ^ ^ ^ ^ 1 — y, these points 
will succeed each other in the order named, as soon as 
n'> 2 /y. It will be assumed for the present that this con¬ 
dition is satisfied; then, in particular, w=£0, and as there 
will be occasion to observe incidentally, [(«-)- l)/»z] 1/2 < 2 . 
Let the values of the integral of | 2 T»| over the sub-intervals be 
denoted by h , • • •, la respectively. The relation | X] C (t) | < g/ 7 c 1! - 
can be used in the second, third and fourth integrals, and 
the relation \Xk(oc) | < g/h 112 , which is independent of t, in 
any of the five. In the middle interval, 


KAx, t)\ = 


2 \&k+i)x 1 c {x)xat) 

h —0 & 


= 2 + Ji2 x ) • ^2 * 

< 1 + S 2.? 2 = 1 +2y 2 « < (2y s +1) n, 

k~l 

and hence 


01 « <L,,„ « f+nmt - 2i2.»*+1). 


The representation of K n (x, t) as a fraction is to be used in 
the remaining integrals. In the first interval, \x —#| 
and 


—l-f. * yj 

*!=£* 2 \Kn(*,t)\dt 


% 7 > \X n {t)\dt 

j Xn (*) | I (0 I ^ ^ 

^ n ~y 1 [(%+i) i/a \ dt+ Xi ^ i d *] 


s -i+t 7 


] 


n -j- 1 2 Caj? - ^SlR. 

— n ' [«(w + i )] 1/2 n 
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In the same way, I 6 < 4 cc 2 g/'ri. In the second interval, 
I Xh-X (») Xn (<) — Xn (ce) Xn+1 «) | 


»+l 8 

2 * [n(n-\-1)] 1/2 ^ *9 ’ 


= 2 


so that 


X ao—l/n px—lfn , 

\KA*,t)\it<1tg'\ - 2 ^, 

-14-t/—14-yi? * ® 


or, by the substitution x—t 


I*< 2 g* 


du 


<2 g' 


1 f 2 du_ = 

eJl/tt W 


2y 2 (log 2 + log n). 


Similarly, J*< 2 y 2 (log 2 -(-logw). As n is an integer satis¬ 
fying the condition n > 2A; > 2, it is certain that n ^ 3, 
log n, > 1, and the inequalities that have been obtained for 
It, ••*, will merely be strengthened if the factor log n 
is inserted in the right-hand members wherever it does not 
occur. By combination of these inequalities, 


J_ x I K n Qr, t)\dt <j <?i log n 


for n’> 2 /tj, the number G x depending only on «?. For each 
value of n belonging to the range 2 < n <[ 2/^, the integral, 
considered as a function of x for —X —t- -ay < a? ^ 1— rj, has 
a maximum value. Let G s be the largest of the finite 
number of maxima thus determined. Then G a depends only 
on ij, and the statement of the lemma is true for all values 
of n ^ 2, if G is taken as the larger of the numbers Gt, 
G a /\og 2 . 

For the application of the lemma, it is to be noticed that 
an arbitrary polynomial of the «th degree can be expressed 
identically as a linear combination of Xo(jx), ••*, X n (x ), 
with constant coefficients. Then a process of reasoning 
which is entirely analogous to that used in the proof of 
Theorem IX, and which need not be repeated at length, 
serves to demonstrate 
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Theorem XI. If f{x) is a continuums function for 

— 1 < aj ^ 1, and S n ix) the sum of the first n +1 terms 
of its Legendre series, n> 1, and if there exists a polynomial 
P n (a i), of the nth degree, such that 

1/0*0 — Pn ix) I <; s n 

for — 1 < x < 1, then, for — 1 +q < x < 1 —ij (//> 0 ), 

I fix) —£»(£&) I <: Se n log n, 
where EL depends only on y. 

The more specific results obtained by comb ining this prop¬ 
osition with Theorems V—VJLLl will not be formulated sepa¬ 
rately, but will be summarized in a single 

Corollary. If fix) has a continuous pth derivative ip > 0) 
for —1 < x 1, with modulus of continuity w(d), and if 
S n 0*0 is the sum of the first n- f-1 terms of the Legendre 
senes for fix), then 

| fix)-Sn(x) logn, 

for n~> i) and for —1 + >/ <! # < 1— n, where Up de¬ 
pends on ij and on p, hut not on anything else; m parti¬ 
cular, the series converges uniformly to the value fix) for 

— 1+^ a? < 1 — n, if fix) itself has a modulus of con¬ 
tinuity ooid) for — 1 ^ x < 1, such that lim a>(d) log d = 0. 

~ ff=0 

Since ^ may be arbitrarily small, the last conclusion implies 
that the series is convergent, not necessarily uniformly, 
throughout the open interval —l<as<l. 

Gronwall(Mathematische Annalen, vol. 74 ( 1913 ),pp. 213 - 270 ; 
Transactions of the American Mathematical Society, vol. 15 
( 1914 ), pp. 1 - 30 ) has shown essentially that 

J^jXnOr, t)\dt Gorib 

for —1 < x< 1, where G a is an absolute constant. It* follows 
that conclusions analogous to those of the preceding theorem 
and its corollary hold for the entire closed interval (—1,1), 



32 


THE THEORY OF APPROXIMATION 


if the factor logw is replaced by n 1/2 , the corresponding 
sufficient condition for uniform convergence being that 
lim^Q 00 (d)/S 1 ^ 2 = 0. But the proof of the relation of in¬ 
equality for the integral appears to be rather long, and will 
not be set forth here. From the definition of K. n > together 
with the facts that | X& (pc) ] <j 1 for — 1 <S x 1 and that 
J* | Xk(t)\dt < c 2 /k 1/2 , it is obvious that 

X 1 \K n (oc, t) | dt < 1 + 4 2 (2 ft +1) Ci/k l,a <Z c a n al2 , 

where c 8 is an absolute constant; and it can be inferred at 
once that the series converges uniformly to the value f(pc ) 
for —1 < 1, if five) has a first derivative with a modulus 

of continuity o-(d) such that lim^Q co(S)/3 1 ^ = 0, while 
there are corresponding theorems on degree of convergence. 
In a later chapter, conditions will be obtained which are 
closer than those thus indicated, though not so close as the 
ones corresponding to the factor n 1/2 . 

From the discussion of approximation in terms of polynomials 
and trigonometric sums it is natural to pass to similar questions 
with regard to developments in series of more general functions, 
particularly the characteristic functions defined by linear differ¬ 
ential equations with boundary conditions. The beginnings 
of such a theory have been presented by the author (Trans¬ 
actions of the American Mathematical Society, vol. 15 (1914), 
pp. 439-466) and W. E. Milne (the same Transactions, vol. 19 
(1918), pp. 143-156). The present account, however, will 
be continued along other lines. 



CHAPTER II 


DISCONTINUOUS FUNCTIONS, FUNCTIONS OF LIMITED 
VARIATION, ARITHMETIC MEANS 

Introduction 

The discussion hitherto has been concerned almost entirely 
with uniform convergence, and with functions that are con¬ 
tinuous throughout the interval under consideration. Corres¬ 
ponding theorems with regard to the approximate represen¬ 
tation of discontinuous functions are naturally less simple, 
and perhaps of less immediate interest. It will be well, 
nevertheless, not to disregard entirely the question how far 
tbe theory that has been outlined can be brought to bear 
on the representation of such functions. This question will 
occupy the present chapter. (For a somewhat different set 
of theorems on the approximate representation of discontinuous 
functions, reference may be made to a paper by C. E. Wilder, 
in the Rendiconti del Circolo Matematico di Palermo, vol. 39 
(1915), pp. 345-361.) There will be occasion, also for the 
further development of the theory as applied to continuous 
functions, by reference to the concept of limited variation, 
and by study of the summation of Fourier series according 
to the method of the first arithmetic mean. A beginning 
will be made by a review of some well-known general theorems 
about Fourier series. 

i. Convergence of Fourier series under hypothesis 
of continuity over part of a period 

Let y(a;) be a function of period 2 tc, which is sqmmable 
over a period, together with its square. (Without change in 
the form of the argument, the discussion can be kept elementary 
for the present, with results which still have a high degree 
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of generality, if it is assumed instead that f(x) is bounded 
and integrable in the sense of Biemann.) Let 

1 rn 1 C n 

a k = — I f(t) cosktdt, lie =— I f(t) &mlct dt, 

71 rJ—71 71 J—n 

n 

S n (x) — eosJix+bkSinkx). 

7c =1 

Then, as may be verified by multiplying out the square and 
integrating term by term, 

— f [f(x) — 8n{x)\~dx 
ft J—7Z 

=-^r ( [f(x)]*dx—-- ( f(x)Sn(x)dx+^- f [S n (x)Ydx 

ft U—n ft —7i ftv—n 

- [| +J: («£+*©] 

+[i+£ «+*] 

=[l+i!«+*]■ 

As the first member can not be negative, it must be that 

Ik) < — P [y(aj)]*cZa: 

for all values of n, and hence that the series 

(«*+&!) 

6 7c=1 

is convergent. It follows that for any function f(x) of the 
character specified, the coefficients a k) bu approach zero 
as k becomes infinite. The identification of the sum of the 
series with the value \_f(x)Ydx is not needed for 

present purposes. 

The assumption that [/(a;)] 8 is summable is not essential 
to the truth of the conclusion. On the hypothesis that/(a;) 
itself is summable, let /n( x ) be the function which is equal 
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to /(a) when | f(x) | < X, and equal to 0 when \f(ac) \ .> X. 
Let a^, bk be the Fourier coefficients of f{x), and aux, bus 
the corresponding Fourier coefficients of fx(x). It is known 
from the earlier work that lirn^oo a/cjv = 0 , linu, =« fox — 0 , 
for fixed X, since fx ( x ) is bounded. Let e be an arbitrary 
positive quantity, and let X be taken so large that 

/*(*) I dx S\ e - 

The hypothesis of summability implies that such a choice 
of X is possible. Then 

1 C n 1 

— I | \Ax) — fx(x)\ cos kX I dx < S , 

7T t /—71 £ 

—r r I [/(a?) —fx 0*03 sin kx | dx < ~e, 

7* t /—Jl * ^ 

for all values of k, and hence 

| I bk — foN\<Y e - 

But for the particular value of X in question there is a k 0 
such that |a*2r| <1 ^ for k ^ fe 0 > and for 

values of k subject to the last condition it follows that 

1 a *l ^ € ) I M ^ *• If f(x) is a summable function of period 

2 it, and zf a,k, bk are zts Fourier coefficients, 


lim an — 0, lim fo = 0. 

Jc =oo 7c =oo 


Now let/Oe), still summable and of period 2 nr, be supposed 
to vanish identically for x 0 — n x < x 0 -\-y, (X^Cnr. 
As was shown in the first chapter, 


Sn (x 0 ) = -i- 



sin (n + $) (t — x q) 
2 sin ^ (t—& 0 ) 


dt . 


Let t —xq = u\ inasmuch as the integral of a periodic 
function over a period is the same, wherever the initial 
point is taken, the limits — nr, nr may be retained after the 
substitution: 
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Sn (%o) 


= 4 f 


ft U —71 


/(*•<, +it) 


sin (n + i) it 
2 sin i u 


du, 


or, after expansion of sin (mi + $w) by the addition theorem, 


1 C n 1 

S n (x 0 ) — x — I / G*o + «) cot -5- u sin nu du 

&Tt J —71 * 

1 r n 

4- -x — I f(x 0 -fti) cos nudu. 

&TT J—71 

The factor cot \u is bounded and continuous over the range 
of integration, outside the interval where /(a? 0 + u) vanishes 
identically. Hence f(x 0 -f-tt) cot^w, like / (a%-f-«) itself, 
is a summable function (or a bounded function integrable in 
the sense of Riemann, if this hypothesis was originally im¬ 
posed on f(x)), and the reasoning of the preceding para¬ 
graph is directly applicable to show that both integrals in 
the last expression approach zero as n becomes infinite, 
that is, 

lim S n (x 0 ) — 0. 

?&= OO 

More generally, if f(x) and y(x) are two summable functions 
(or two functions satisfying the alternative hypothesis), if 
8 n(x) and s n (x) are their respective Fourier sums of the 
wth order, and if f(x) and y (x) are identically equal for 
Xq— ij < x < x 0 +V) then 


lim [S n (x 0 ) — s n (a*)] = 0 ; 

n—00 

if the Fourier series for y (x) converges for x — Xo to the 
value y (x 0 ) — f(x 0 ), the Fourier series for f(x) does the 
same. This is commonly expressed by saying that the con¬ 
vergence of the Fourier series for a given function at a speci¬ 
fied point depends only on the behavior of the function m the 
neighborhood of the point; it is tacitly understood that only 
functions of some specified class are considered, as, in the 
present connection, summable functions, or functions which 
are bounded and integrable. 
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It is possible then to pass immediately from Corollary Ila 
of Theorem IX in the first chapter to the following statement: 

Theorem I. If f(s£) is a summable function of "period 2 n (or, 
less generally, a function of period 2 tz which is bounded and 
integrable in the sense of Kiemann), and if there is an inter¬ 
val (x 0 — V, + throughout which fix) is 

continuous, with a modulus of continuity <o (S') such that 
lim^Q 00 (d) log 8 = 0 , then the Fourier series for fix) con¬ 
verges for x — x 0 to the value f (pc 0 ). 

For a periodic function <p (x) can be defined as equal to 
fix) in (a?o — q, xo + q), and linear, say, from + to 
x 0 — t} + 2 7 t, and this <p {x) will have a modulus of continuity 
satisfying the requirements of the Corollary cited. 

Attention will next be directed to the simplest case of 
convergence at a point of discontinuity. Let fix) now be 
a function which has a “finite jump”, or discontinuity of the 
first kind, at the point x — ocq, approaching limits which may 
be denoted by f(x 0 —) and f(x 0 +) as x approaches x 0 from 
the left and from the right respectively. Let it be supposed 
that the values of fix) in the interval x 0 — q<^x<.x 0 , to¬ 
gether with the value fix o—), form a continuous function 
with modulus of continuity (o t (d), and that fix) is likewise 
continuous, with the other limiting value at x a , in the interval 
from ocq to Xo + n, the modulus of continuity this time being 
« 2 (d). For each value of 8 , let a>(d) represent the larger 
of the numbers oo x , « 2 . It will be said briefly that fix) is 
continuous for a? 0 — V ^ x < x a -\-ij, ivith modulus of con¬ 
tinuity except for a finite jump at the point x 0 . It is 

understood always that fix) is summable over a period. If 
lim^o^OS) log 8 = 0 , Theorem I establishes the convergence 
of the Fourier series for fix) at all interior points of the 
interval (jc 0 —*?, xofi- q), other than x 0 , and it remains to 
consider the question of convergence at the point ar 0 itself. 

For this purpose, it may be assumed without loss of gene¬ 
rality that x 0 = 0 . For the terms of the series which involve 
cosand Sinn#, taken together, are identical with the 
corresponding terms in the development of fix) as a function 
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of as — r 0 . In formulas, let x — x 0 — y, /(a?) — sp (y), 
t — xq = u . Then v — y — t — or, and 


£ r*7i 

5P (it) eos n n d n + sin n y J_ n <p (it) sin n u cl u 
(p (w) eos n{u — y) clu = ) cos n(t — o') dt 

On 

= cos nx cos n t dt + sin n x \ n f(t) sin ntdt. 


The assumption that x 0 = 0 is therefore equivalent to a change 
of variable which does not affeet the conditions of the problem. 

With this understanding, let the symbols/(a 0 +) and f(x „—) 
be replaced by /(0+) and /(0—), let 

M x ) =f(x) ( 0 <x< tx), .A i x ) =fi(— x) ( — Tr^rcCO), 

MO) =/( 0 +), 

and let f s (x) similarly be an even function identical with f(x) 
for — n < x < 0 , and taking on the value /(O —) for x — 0 . 
Let Sni (%) and S n2 (x) be the partial Fourier sums for f x (x) 
and f 2 (x) respectively. The function J x (x) is continuous 
throughout the interval — V x ^ V • If Xx and x a are two 
numbers belonging to this interval, and if \x a — x x \ < 8 , 

|/i(asa)~-/tOri)| < ©(d) 


if sci and have the same sign, or if one of them is zero, and 

l/ife)—/iM < I fi fo) ~/i (0)| + |ft (0) -f x (x x )[ <^*2©(d) 

if Xi and a* have opposite signs. Similarly, f a (x) has a 
modulus of continuity < 2 ©(d) for — y < x y. If it is 
supposed that lim^o ©(d) log 6 = 0 , 

lim S n i (0) =/i(0) = /(0 +), lim S„ 2 (0) =f a (fi) =/(0~), 

»~O0 ?J-=00 

by Theorem I. But 
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Sn l(0): 

s n m- 

8 n ( 0 ): 


J_ C 71 f , A sinp + ^H 

2 sin 4 1 

_L C 71 „ tA sin(?t + ^£ 
nJ-jr*' 7 2 sin £ t 

sm(w + ^)^ 
2sini# 


4J>> 


dt ■ 
dt- 


dt ■ 


AT A0 Si<z±M'j, 

xJo jw 2sin£f 
JLJ 0 jasmin+ ls)t rl4 


2 sin£f 


[8,a (0) + 8ni (0l]. 


and therefore 


lim S n ( 0 ) = [/(0 +) +/(0 -)]. 

1 = 00 ^ 


The conclusion may be appended to the theorem as 

Corollary I. If f{x) satisfies the conditions of Theorem I. 
except for a finite jump at the point x Q , the series converges 
at Xo to the value ^ [fixa +) -f- f{x a —)]. 

So far, nothing has been said about uniformity of con¬ 
vergence. To lead up to a discussion of this topic, the 
following lemma will be established: 

Lemma I. If fix) is an arbitrary sammable function, the 
integrals 

A k (y) = J^ n f(.t) cos ktdt , Bkiy) = J\f(f) sinkt dt. 

approach zero, asJc becomes infinite, uniformly for —tc -fy 
If fix ) is bounded and integrable in the sense of Riemann, 
the proof holds without change of form when the integrals 
are thought of as Riemann integrals. 

In the first place, it is clear that 

linifc =0 o^.fc ip) = limfc=oo2?&(y) = 0 


for any fixed value of y in the interval. For Ajc(y)/n and 
Bk iy)Jnc are the Fourier coefficients of a function ip ( x ) which 
is equal to f{x) for — n <, x y, and equal to 0 for 
•y<Lx < it. Let e be any positive quantity. Let 

Aiy) =p_ n \fif)\dt. 

This function is continuous for — n <j y < n, and so 
uniformly continuous. Let <5 be a positive number such that 
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\Ai,y i )—A{yi)\<-^re 


if y t and y% are any two points in (— tt, tv) for which 
\y & — yi|<d. Let N be an integer, for definiteness the 
smallest integer, such that 2 n/N<L 8 , and let Zj = — jt-\-[ 2 jnlN), 
j == 0 , 1 , • • •, N. By the remark made above as to the 
behavior of Ak (y) and Bk (y) for fixed y, there is for each 
value of j a number kj such that |- 4 fc 0 j)| <■£«, \Bk(?j) \ <.\t. 
for k ^ kj. Let k' be the largest of the numbers k 0 , • • •, kn. 
If y has any value in .the interval (— tv, tt), there is a zj 
such that 0 <; y — Zj<d. Then, for any value of k, 


! Ak (y) — Ak (zj) | -= IJ> eosA*^d!^| r 1/(0 1 d 

= A(y) — A(zj)<-je, 
\B h (y)-B k (zj)\<±-e. 


If k > k', 

\Ak (y)\ ^ \A k fa )| + |-i/c (y) — Ak fe)l <«, I Bk {y)\ <e. 


This is equivalent to the conclusion of the lemma. 

The existence of the period 2 tv for f(x) being understood 
throughout, let the definition of A k (y) and B k (y) be extended 
to all real values of y. For any y x and y s , 

I /(<) cos lctdt = Akiyt)— Ajc(y%). 

Here and subsequently, each relation written down for A k 
has a counterpart involving Bk. Let e k be defined for each k 
as the larger of the maximum values attained by \Akiy)\, 
|-B*(y)i in (— tt, tf). In particular, \Au{tv) | <= e n, 

The lemma just proved is equivalent to the statement that 
limfc=o<> = 0. Because of the periodicity of f(t) cos Jet 
and fit) sinfcf, the integrals defining Ajc(y) and B k (y) are 
not altered in value if both limits of integration are increased 
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or diminished by the same integral multiple of 2 nr. For 

example, if y is in the interval (n, 3 nr), 

\ A k (y) — Ak(n) { < e k , 

whence it follows that | A k (y) | ^ 2 e k in this interval. If y t 
and 2/ s are anywhere in (— n, 3 nr), the corresponding valnes 
of Ak will differ by not more than 4 e k . Finally, if and 3/3 are 
any two numbers subject to the condition that \y & — y x \ <1 2nr, 
there will be an integer j (positive, negative, or zero) such 
that y x -\- 2 jn and yzA~ 2 jn r belong to the interval (—nr, 3 jt), 
and 

\AiAyz) — -IfcO/jl ^ 4«t. 


A similar relation holds for B k . 

The purpose of these details is to bring out a fact which 
will be used presently. For any cc , and for any y in (— n, n), 


! /(0 sin n (t as) dt 

= | tosnx[£n(x-\-y)—B n (x— nr)] — sin nx [A»(a;4- y)—A n (x — nr)] j 

or, by the substitution t—x — u, 

| J ** fix + u) sin nu du <1 Se n 


for arbitrary x and y, subject to the condition that —n < y 
< nr. (It is clear that the interval from — nr to y may be 
replaced by any other interval of length < 2 nr.) 

The next stage of the discussion may be summarized in 
another lemma: 

Lemma II. If j (x) is u summdble function which vanishes 
identically for a—ij ^ x < ft-fy, with j?>0, its Fourier 
series converges uniformly to the value 0 for 

The proof will be expressed in terms of the Lebesgue 
theory of integration, and, unlike those that have gone before, 
would have to be appreciably modified in form if reference 
to that theory were to be eliminated. A more elementary 
method of proof appropriate to the case of functions bounded 
and integrable in the sense of Riemann will be indicated later. 
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Let x have a value belonging to the interval 
By the usual formula, 

, n if 71 f(i\ sin(w+ i) (f — x) , 

2sini (F=xT dt 

=( y(0cot-~(i—x)sinw(f— x)dt 

A™ J-71 * 

+ 77— r f(t) cos n(t — x)dt. 

&n J-u 


The second of the two terms making up the last expression 
is equal to £(o« cos nx-\- b n sin nx), and approaches zero 
uniformly for all values of a;, as an immediate consequence 
of the fact that an and l n approach zero. 

In the first term, let t — x — u. Then the integral is 
equal to 

J n (x) = £ n f(x + u) cot —u sin n u cl u . 


Let C(u) be a function of period 2n which is equal to 
cot \u for — tv < xi < — 7j and for y u < n, equal to 0 
for — it] u < \rj, and so defined for —y < u < —$y and 
for iv ^ V as *o have a continuous derivative every¬ 

where; it is scarcely necessary to write down an explicit 
formula for the construction. As f(x-\- u) is identically zero 
for —y <u<n, when jc is in (a, /?), the integral J n (a?) is 
the same as 

+ u ) 0(u) sin nu du. 

Let 


/"„/(* + ») 


sin nudu — F(y)\ 


the values of n and x are for the moment to be regarded as 
fixed. The function F{y) has almost everywhere a derivative 
equal to f(x-\-y) sin ny. Consequently, as C(u) is differ¬ 
entiable everywhere, the function F(u) C{u) has almost 
everywhere a derivative equal to F' (m) C(a) + F(u) C' (u ). 
Furthermore, F(u) is absolutely continuous, and C(u), having 
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a bounded derivative, is absolutely continuous also. Hence 
F(u) C(u) is absolutely continuous, and its increment over 
an interval is equal to the integral of its derivative over 
the interval. That is, in particular, 

Fin) C(n) — F( —jt) <7(— n) 

— F' (u) C(n) du-\- F(ii) C (it) da, 

Jn(cc ) — F' (it) C(u ) du 

= F(n) C(n) — F(—n) C{—u) — J^F(u) C(u) du. 

But C(n) — C{ — ti) — cot (± i n) — 0. By an earlier 
paragraph, \F(u)\ < Se n for —n < n < tc, where e n is in¬ 
dependent of a:; if ft is the maximum of the continuous func¬ 
tion | C r (u) |, which is independent of x and n, 

\Jn(x)\ — | C"(tt) du < 16 7t ft €„. 

So J»(a5) approaches zero uniformly for the values of a; in 
question, S n (x) does likewise, and the lemma is proved. 

It follows at once that if /( x) and y (x) are two summable 
functions, identically equal for a —»? < a: /S +»?, and if 
the Fourier series for <p (x) converges uniformly to the value 
9>(«) = /(*) for a the series for fix ) does the same. 

In particular, Theorem I may be further supplemented by 

Corollary H. If fix) is a summable function of period 2jt, 
and if there ts an interval a — < a; < /S -j- >• 0, through- 

out which fix) ts continuous, ivith a modulus of continuity »(<J) 
such that limrf_ 0 «(d) log d = 0, then the Fourier series for 
fix) converges uniformly to the value fix) for a < x P. 

3. Convergence of Fourier series under hypothesis 
of limited variation 

As the next pages will be concerned largely with functions 
of limited variation, the insertion of a proof of the theorem 
about the convergence of the Fourier series for such a function 
is not out of place. 
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Let fix) be a function of period 2 nr, having limited 
variation over, any finite interval. Since a shift of the 
origin from which x is measured does not change either the 
character of the function or the terms of its Fourier series, 
when sine and cosine terms of like order are taken together, 
it is sufficient to consider convergence for x = 0 . 

Since fix) is of limited variation, its discontinuities, if any, 
are finite jumps. Let ip{x) be a function of period 2 n which 
is linear for 0 <.x< 2 n, and further defined by the con¬ 
ditions y>( 0 +) =/( 0 +), Ipi 0 —) = Ip( 2 n—) =/(' 0 —), 
ip ( 0 ) — £ [/(0 +) +/(0 —)]. If fix) is continuous for x — 0 , 
ipix) is merely a constant. If fix) is discontinuous for x = 0 , 
ipix) is equal to the constant 4 [/( 0 +)+/( 0 —)] plus an 
odd function, and its Fourier series consists of this constant 
plus a seines of sine terms, all of which vanish at the origin. 
So the partial sum of this Fourier series not merely approaches 
ip ( 0 ) (in accordance with Corollary I above) but is always 
exactly equal to ip(Q). As fix) — ip (aj) + [fix)—ip (x)], 
the problem of convergence for fix) reduces immediately to 
the corresponding problem for the difference fix) — ipix). 
which is continuous at the origin, if fix) is defined for 
x = 0 as equal to the mean of its limiting values. There 
is no loss of generality therefore in assuming at the outset 
that fix, ) is continuous at the origin and vanishes there. 
This assumption will be made henceforth. 

By the hypothesis of limited variation, fix) can be ex¬ 
pressed for —n < x ^ tc in the form fix) — (as)— 9 >s(a 0 , 
where <jPi and are bounded and monotone increasing 
throughout the interval, are continuous wherever fix) is 
continuous, for as — 0 in particular, and vanish for sq — 0 . 
For the study of the expression 


let 


SniO) = 


L f f(f\ sin(n-H)* 
nJ-n J 2 sin £ * 


dt, 


hi 



sin jn-\-\)t 
2 sin £ t 


dt , 
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let jxi be the integral similarly formed with in place of yi, 
and let jn and j aa be the corresponding integrals from —n 
to 0. Then 5^,(0) = (1 / n) (j xl — Jss)- 

If £i and ? a are any two numbers belonging to the interval 
(0, n), and if n is any positive integer. 


r £ » sin(n+^ di 
J% x 2 sin 1 1 


< Btt. 


For the integrand is alternately positive and negative over 
intervals of length the arches of its graph dimin¬ 

ishing steadily in height from left to right. The integral 
over an interval corresponding to any number of whole arches 
is therefore a sum of diminishing terms of alternate signs, 
and its magnitude does not exceed that of the largest term, 
namely the area of the first arch involved. The integral 
from any & to any ? a is made up at worst of such a sum, 
plus the integrals corresponding to parts of two other arches. 
But the integrand, being equal to £4" cos t-\- ••• + coswf, 
never exceeds n-\-\ in absolute value, and the area of any 
one arch, with base can not exceed n. So the 

magnitude of the integral from ?i to ? 2 can not exceed 3 n. 
(It is almost as easy to see that the value is actually less 
than 2n, and a still lower bound could be obtained, but the 
inequality as written is sufficient for the purpose in hand; 
the essential thing is that the right-hand member is independent 
of 5i> and n.) 

If 0<d <j < £ a <7i, the number 6 being regarded 
as fixed, 

I f*' sin (re + t , | 3 recy _ 1 

| Js x 2 sin ^ t ' I = n + £ ’ <& 2 sin 4 d' 


For the absolute value of the integrand never exceeds c#, and 
the magnitude of the integral can not exceed the sum of 
the areas of three arches, each of base tcKji + and‘of 
height not greater than c^. 

Let * be any positive quantity, and let S be chosen (by 
virtue of the continuity and vanishing of spi at the origin) 
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so that <px (d) ^ «/(6w). The integral defining may be 
taken as the sum of the integrals from 0 to d and from d 
to 7 t. The second law of the mean may be applied in each 
of these integrals. For that from 0 to <5 it gives 



sin 

2 sin \t 


dt 



sin jn + j) t 
2 sin 


dt, 


where £ has some value in the interval (0, d); by the second 
paragraph preceding, the absolute value of the integral on 
the right does not exceed 3 tv, and consequently that of the 
integral on the left does not exceed \e, for any value of n. 
For the rest of in, 



in + £) t 


sin \t 


+ y>i 



sinCn + i); 

2 sin ^ t 

2 sin $ t ’ 


the new £ being in the interval (d, tt); here spi (d) < ^ (Vr), 
and the absolute value of each integral on the right, 
by the preceding paragraph, is less than or equal to 
3 71 Gtff in -f- -^)» So 



sin (w + D* j 
2 sin ^ t 


- ’ 


which is less than as soon as n is sufficiently large. This 
means that j 1± approaches zero as n becomes infinite. By 
similar treatment of j\t, j a lf and j aa , it is recognized that 
S n (0) approaches the value 0 == f(0 ). 

For functions having the original degree of generality, 
the result may be stated in the following form: 

Theorem II. If fix) is a Junction of period 2 jx having 
limited variation over a period, its Fourier series converges 
to the value fix) at every point where fix) is continuous, and 
to the mean of the values approached from the right and from 
the Iqft at every point where fix) is discontinuous . 

If fix) is discontinuous at the origin, the details of the 
above calculation apply properly not to fix) itself, but to 
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the difference fix) — <p(x), the retention of the symbol/(a) 
having beeu equivalent to a change of notation. Let it be 
supposed now that fix) is continuous throughout. Then the 
complication just mentioned does not arise (except for the 
subtraction of a constant to make f(0) = 0, which is of 
no consequence). The functions tpi (x) and (x) may be 
taken as the positive and negative variations of fix) itself 
(in the sense that spi(cc) is the positive variation from 0 to 
x when x is positive and minus the positive variation from 
x to 0 when x is negative, <p 3 being similarly defined;. The 
use of the origin as representative of an arbitrary point x a 
in the convergence proof amounts to another change of 
notation, in connection with which gPi(d), in the course 
of the reasoning, takes the place of (x 0 + <5)— fp±(x 0 ). 

Under the present hypotheses, </>i and y 2 are everywhere 
continuous, and so uniformly continuous. Hence the choice 
of 6, if the proof is written out in terns of the general no¬ 
tation, can be made independently of x 0 - For any x 0 , further¬ 
more, SPi (x 0 + n) — <jPi (r 0 ) and <p 3 (a? 0 + rc) — <p% (x 0 ) can not 
exceed the total variation of fix) over a period, a quantity 
likewise independent of x 0 . These are the essential points 
needed to justify the supplementary assertion: 

Corollary I. If fix) satisfies the hypotheses of Theorem n, 
and ts furthermore continuous everywhere, the Fourier series 
converges to fix) uniformly for all values of x. 

If it is assumed merely that fix) is of limited variation for 
xo — i] x ■< x 0 -{-7i, and summable over a period, the 
function y (a?) which is equal to fix) for x 0 — q < x < r 0 + 1 ?, 
and identically zero over the rest of a period, is of limited 
variation over the entire period, and its Fourier series converges 
at xo to the value | [y (.t 0 +) + y(#o—)] = 4 [/(ak+)+/0s>—)], 
from which it follows that the Homier series for fix) does 
the same. If fix) is continuous and of limited variation for 
a—y < x < /£ + *?, and summable over a period, the 
function y (as) of period which is equal to fix) for 
a —tj < x <j and linear for fi 4 -n ^ x ^ 

is of limited variation and continuous over the entire period, 
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and its Fourier series converges uniformly with y (a?) for its 
sum; the Fourier series for f(x) —y(a?) converges uniformly 
to zero for a by Lemma II; and consequently the 

series for /(a?) converges uniformly to /(a?) for « ^ x < ft. 
These conclusions may be expressed as 

Corollary II. If f (x) is a, function of period 2 n, summalle 
over a period, and of limited variation for x 0 —tj ^ a?< a? 0 4 -y, 
its Fourier series converges fen' as = x 0 to the value 
i[/(*o + )+/fe*>— )]; if f( x ) iS of period 'In, summalle 
over a period, and continuous and of limited variation foi' 
a — tj a: < y#-j- its Fourier series converges uniformly to 
the value /(as) for a x ^ jS. 

It may be noted in passing that the second law of the 
mean leads to a proof of Lemma II for functions that are 
bounded and integrable in the sense of Riemann, without 
the use of Lebesgue integration. In the expression which 
was denoted by J n (x), in the proof as given previously, let 
the integrals from — n to —y and from y to n be considered 
separately; the integrand is identically zero for —y <Lu <y. 
Since eot^w is monotone from y to n, and cot^yr = 0, 


J” 1 /(as+w)cot-|-wsinwM<fw = |cot-|-^)j* f(x-\-u) sin me du, 


where ? is a number of the interval (y, n). The magnitude 
of the last integral does not exceed the quantity Se n , which 
is independent of x and approaches zero as n becomes infinite. 
Similarly it may be shown that the integral from — n to — y 
approaches zero uniformly, and the conclusion of the lemma 
follows at once. 

3. Degree of convergence of Fourier series under 
hypotheses involving limited variation 

Attention will now be directed once more to questions of 
degree of convergence. The next theorem is a rather simple one: 

Theorem III. If fix) is a function of peinod 2n with 
limited variation, the total variation ovei' a period leing V, 
then, for n~> 0, 
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COSfJiK dx 




sin nx dx 


<: 


r 

n' 


Let <Pi (a;) and sp s (a;) be the positive and negative variations 
of fix), starting from the point — it, so that/(a:) =/(— 71 ) 
H-g 0 x(a?)— 9 a(x) } so t (— it) — y 2 (— it) = 0. Since f(n)=f( — n), 
SPi(n) — «jP 3 (nr) — 0. But SPi (n) + 9s in) = V, and therefore 
9 >i(n) = 9>aM = $F. By the second law of the mean, 



cos nx dx 



wJT' 

r(s 


cos nxdx 
smnit —sin n§ 


n 



Similarly, 

while 




| S~7t ^’ 1 ^ C0S nX ^ X 
| gps (as) cos nx dx 

J^ n fi — it) cos nx dx — 0. 


— 2 n 


Fsinw? 

2n 


So the first inequality of the theorem is obtained. A mechan¬ 
ical repetition of the reasoning for the integral with sin nx 
would lead to the expression cos n ? — cosmtc in place of 
sin n it — sin«.§, and the new expression has the maximum 
value 2 instead of 1, since cos nit ^ 0. But by virtue of 
the periodicity the integral from — it to n is the same as that 
from —it -f- [W(2 w)] to 7 r + [^/(2 m)], and if the variations 
are measured from the left-hand end of the latter interval the 
upper bound V/n is obtained once more, as stated in the 
theorem. 

Suppose now that fix) has a first derivative with limited 
variation, and let V be the total variation of f ( x) over a 
period. By integration by parts, 


r>) 

1 C* 

= — — I f (x) sin nx dx, 
n e/—7T 


4 
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whence it follows further, since Theorem III is applicable to 
the last integral, that 

| J^/(aj) cos nx dx 1 = j (x) sin nr dr ^ -| s . 


Similar reasoning applies to the integral with sin nx in 
place of cos nx. Repetition of the process of integration 
by parts leads to the following: 

Corollary I. If fix) is a function of period 2n which 
has a pth derivative with limited variation, p 22 0« and if V 
is the total variation of f lp) (r) over a period , then, for n~> 0, 



cos n x dr 


^__r_ 

^ Jfp m f-1 9 



sin nx dx I < 


T 

* 


The conclusion is really somewhat more general than the 
statement would indicate. Suppose for simplicity once more 
that p = 1. As will be seen on re-examination of the 
proof in the light of well known theorems on Lebesgue inte¬ 
grals, the essential thing is not that /'(.r) be uniquely de¬ 
fined at every point, but that f(x) be expressible as the in¬ 
tegral-of a function <jp (x) of limited variation: 


fix) = f(a)+j^ y(x)dx, 


for any value of a. This observation is of some interest, 
since the simplest functions represented by graphs with cor¬ 
ners satisfy the modified hypothesis, but not the original 
one. The generalization carries over to the applications of 
the Corollary in Theorems IV and Vb below. 

For the sake of another corollary, let f(x) be identically 
zero fora*)—^ < a; <! a*> -f- ; and of limited variation over a 
period. If &n (x 0 ) is represented once more by the formula 


1 C n 1 

Snipco) = —J /(aso + tt) cot-g- u sin nudu 


+ -g—/(a*) -f w) cos nit dit, 



II. LIMITED VARIATION, ARITHMETIC MEANS 51 

both fix a -\-u) and f(x 0 -f- u) cot \n are of limited variation, 
regarded as functions of u, and consequently | S n ix 0 ) does 
not exceed a constant multiple of 1 In. With regard to 
uniformity of convergence, let fix) be identically zero for 
«—V ^ x ^ ft f-y, the hypothesis of limited vai’iation over 
a period being kept unchanged. When « < a* 0 < ft, the 
function f(x 0 -|- u) cot is identical with f{x Q + ?t) C x (»), if 
Ci (m) is defined as equal to 0 for 0 < | u | < y, and equal 
to cot \u for rj < |u| ^ 7t. In general, if y>i (tf) and sPa («) 
are any two functions of limited variation, the total variation 
of spi (m) being and that of y a (u) being T a , and if M t 
and M s are upper bounds for | <pi (u ) | and | («)! respectively, 

the function yi (*/) («) is of limited variation, and its total 

variation does not exceed M x V s + M a T i. In the present 
instance, y x («) and sps (t<) being replaced by f(x 0 +•«) and 
Ci (u) respectively, fix 0 -\-u) has the same total variation 
over a period and the same bounds as fix), for any value of 
x 0 , and Ci (u) is independent of x 0 , so that the total varia¬ 
tion of f(x 0 -\-u)cot$u, while presumably different for dif¬ 
ferent values of x 0 , has an upper bound independent of x 0 , 
as long as x 0 belongs to the interval (a, fi). Such an upper 
bound can be calculated more specifically as the product 
of V, the total variation of fix), by a quantity depending 
only on y. (The absolute value of fix) can not exceed V 
anywhere, since every period contains points where fix) 
vanishes.) It is possible therefore to state 

Corollary II. If fix) is a function of period 2 n with 
limited variation , the total variation over a period being Y, 
and if fix) vanishes identically for «— -i\, then 

!«.(*)! <^ 

for a ^ x < /S, where 8 n ix) is fhe partial sum of the Fourier 
series for fix), and Ci] depends only on y. 

Each of these corollaries, taken in conjunction with results 
obtained earlier, leads at once to a theorem on degree of 
convergence. If p ^ 1 in the hypothesis of Corollary I, it 

4* 
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is known that the Fourier series for f{x) converges uniformly 
to the value /(sc); this follows from Theorem IX of Chapter I, 
and more directly from Corollary I or Corollary Ila of that 
theorem, since the hypothesis provides f{x) with' a bounded 
first derivative, and 

l/tes)—/G*i) I ^ AI Xi — x L I, 

if A is an upper bound for | f if) |. On the other hand, Corol¬ 
lary I, interpreted in terms of the Fourier coefficients, 
states that 

\an\S V/{nn*+'), | l n | < V/{nn^). 

Hence 

|/(a;)—/Sn (x) | = 11. {ah cos kx-\- bie sin kx) 

17c=n-1-1 

< 2 (|afc|+|6fc|) 

k=n+ 1 

<9 ’f’ — < 21 C°° du ^ VL 

== ~‘k^n+m fc p+1 = n Jn M * +1 pnn v ' 

The conclusion may be formulated as 

Theorem IV. If fix) is a function of period 2 rt which 
has a fth derivative with limited variation, p 1, and 
if V is the total variation of f (p) {x) over a period, then, for 
n> 0, 

!/(»)-«. w | 

where S n (x) is the partial sum of the Fourier series for fix), 
Q p is a constant depending only on p, and Q is an absolute 
constant : more specifically, Q p — 2/ipn), Q = Q t — 21 n. 

This result is to be compared with Theorem X and the 
corollaries of Theorem IX in Chapter I. 

Corollary II will be combined with Corollary 33 of Theorem IX 
in Chapter I, and with the theorem just obtained. In connection 
with the result from Chapter I, it is to be noted that if f{x) 
is a continuous function which is not identically constant, 
and if ®(d) is its modulus of continuity, 

lim inf > 0. 

<f=o 6 
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For if this were not the case, it would be possible for every 
e > 0 . to find arbitrarily small positive values of d such that 
for any such d, |/(a? s ) — /(rOl < e j x 3 — a\ \ , if 
x s — cc l is an integral multiple of d, since |/(a?i +-Vd)— f(x L ); 

< < s . N8, if N is integral, but for any choice of 
x 1 and 3 %, it would be possible to find values of d satis¬ 
fying the condition that «(d) <ed, and so small that j- s — a\ 
differs arbitrarily little from an integral multiple of d; and 
then it must still be true by continuity that I f(x») — f(x t ) | 

* | x a — x x |, which in view of the arbitrariness of e means 
that f(x$) — f(x x ) — 0 for all x t and x 3 , contrary to the 
hypothesis that f(x) is not constant. The conclusion may be 
stated by saying that ®(d)/d has a positive lower bound for 
values of 6 that are sufficiently small, as <a(d) itself has 
a positive lower bound over any interval not reaching to the 
origin, it appears further that «(d)/d has a positive lower 
bound over any finite range for d. 

Suppose now that fix), having the period 2 jt, is con¬ 
tinuous with modulus of continuity «(d) for a —'9 ^ ^ 

but not constant over the interval, and of limited variation 
(but not necessarily continuous) over the rest of a period. 
Let <p{x) be periodic with period 2n, equal to f(sc) for 
a—n <Lx<0-\-vi, and linear for 0 J r f i ^ x <1 a — ij-\-2rr. 
In the latter interval, the modulus of continuity of <?(x) is 
a constant multiple of d, which by the preceding remarks 
does not exceed a constant multiple of ®(d), say &«(d), over 
the range within which <»(d) is defined, namely for 0 < d 

< 0 — « + 2 ^. Ifd satisfies the latter condition, any inter¬ 

val of length d is made up at worst of an interval congruent 
(modulo 2 jt) to + « — 17 + 2 ti), together with parts ot 

two intervals congruent to (a — 17 , /£ + ??), and so g> (a?) has 
everywhere a modulus of continuity (d) which for 0 <d 

< 0 —ce -f- 2 n does not exceed Qc + 2 ) co (d). By the corollary 
cited from Chapter I, <jp ( x ) differs from the partial sum of its 
Fourier series by not more than Aco 1 (2n/n) logn, and so 
by not more than AQc-\- 2) (*>(2Tt!n) log n, if n is large enough 
so that 2n!n comes within the specified range of values for d. 
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On the other hand, f(x) — f ( x ) is of limited variation, and 
identically zero for a — ^ < x so that the recent 

Corollary II is applicable; and the quantity 1 /n which enters 
into the conclusion of that corollary does not exceed a constant 
multiple of a(2n/n). So it is possible to state 

Theorem Ya. If the function f(x), of period 2 n, is con¬ 
tinuous with modulus of continuity aid) for a — 1 j x < 

where <w(d)>0 for <$>-0, and of limited variation (bit not 
necessarily continuous) over the rest of a period, then 

\f(x) — S n 0*01 ^ coo(2n/n)logn 

for a < x < J3, if n is large enough so that 00 (2 n/n) has 
a meaning, S n (x) being the partial sum of the Fourier series 
for f(x), and c a constant depending neither on x nor on n. 

The combination of Corollary II with Theorem IV (or 
rather with a part of that theorem) proceeds with a little 
more facility. Let f(x) be a function of period 2n having a 
first derivative with limited variation for a — < x 

while f(x) itself is of limited variation over a period. 
Let <p (x) this time be of period 2n, equal to f(x) for 
a—y <L x <> 0-{-y, and defined for J3 + ij <! x < a — 
as a polynomial of the third degree so that g> (x) and its 
derivative have determinate values at both ends of the interval. 
Then $p (x) has a first derivative everywhere, which is of limited 
variation over a period, and f(x) — <p(x) is of limited variation 
over a period and identically zero for a — ij <> x < ft 
It remains to apply Theorem IV to g>(x), and Corollary II 
to /( 2 c) — gp(as), and the following theorem is obtained: 

Theorem Vb. If the function f(x), of period 2 n, is of 
limited variation over a period, and has a first derivative of 
limited variation for ct—ij < x < d-\-y, then 

\f(x) — S n (x) \ < ~ 

7b 

for a <1 x < fi, where S n (x) is the partial sum of the Fawner 
series for f(x), and c is a constant depending neither on x 
nor on n. 
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Space will not be taken liere for the working out of more 
elaborate combinations of similar character. A theorem re¬ 
lated to Theorem III, however, will be obtained for the sake 
of a subsequent application. 

It was shown early in the chapter that the Fourier co¬ 
efficients of an arbitrary summable function of period 2 tc 
approach zero as a limit. Let f(x) now be a function of 
period 2n which is absolutely continuous, and let a*,, 1>k be 
its Fourier coefficients. Its derivative exists almost every¬ 
where, and is summable. The product f(x) sin kx is like¬ 
wise absolutely continuous, and so has a derivative almost 
everywhere, and its change of value over an interval is 
equal to the integral of its derivative. Specifically, it has 
a derivative equal to f'(x) sin kx-\-k f(x) cos kx at every 
point where f'{x) exists. Consequently 


0 = 


hak — 


f (nr) sin k n —f (— n) sin (— k n) 

X 7r niz 

f (x) sin kx dx-\-h I f(x) cos kx dx, 

-7T 

-— I f (a) sin kxdx. 

TC J-7T 


But by the theorem cited at the beginning of the paragraph, 
the last integral approaches zero as k becomes infinite, since 
f'(x) is summable. Similarly, kbk approaches zero. 

It was seen at the beginning of the chapter that 

[f(x)—S n {x)Ydx 

But under the hypothesis of the moment f(x), being abso¬ 
lutely continuous, is a fortiori of limited variation, and hence 
S n (x) converges uniformly to f(x). So the left-hand member 
approaches zero, and the right-hand member must do the 
same: 


~ r [/(«)] 2 — ir +5 ( a i + 6 d * 
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(It is still not necessary for the purpose in hand to establish 
the well-known fact that the last relation holds under much 
more general hypotheses. Use of the theorem on the con¬ 
vergence of the Fourier series for a function of limited variation 
can be avoided by reference to the least-square property of 
the sums Snix), discussed in the next chapter, together with 
the fact that by Weierstrass’s theorem trigonometric sums 
T n (x) can be found so as to make fix) — T n {x) approach 
zero uniformly.) From the last two equations, taken together, 
it appears that 

— r [f(x)~Sn{xWdx = Z U + il). 

71 O—k A=»+l 


Let e be an arbitrary positive quantity. It is possible to 
choose no so that |fca*| < (■£ e) 1/2 , < (i«) 1/2 , for & l> n 0 . 

Then, if n ^ n 0} 


Z (&!+&*) ^ a Z 

■=n+l 

which means that 


< £ 


— k^n+i Jc S ~ J” U 


du 

2 


n 


limw Z («!+&£) = 0. 

n=oo k=n- j-l 


The results of the last two paragraphs may be summarized in 
Theorem VI. If fix) is an absolutely continuous function 
of period 2 n, avid zf On, b n are its Fourier coefficients, 


lim na n = 0, lim nb n — 0. 

n=oo n —oo 


If 8*(x) is the partial sum of the Fourier series far fix), 
awl if Yn is defined by the equation 

Yn = [fix) —^(m)] 8 dx, 

then 


lim ny n — 0. 

n=oo 


The first part of this theorem supplements Theorem HE: 
the products na n , nb n are bounded for any function of limited 
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variation, and approach zero if the hypothesis of limited 
variation is replaced by the more stringent requirement of 
absolute continuity. The last part of the theorem will be 
used in the next chapter. 


4. Convergence of the first arithmetic mean 

The next undertaking will be to outline a theory of the 
convergence and degree of convergence of the approximating 
functions with which the name of Fejdr is associated, the first 
arithmetic means of the Fourier series for a given function. 
The mean in question is defined by the identity 

«.(*) + 

where 8k(x), as usual, denotes the partial sum of the Fourier 
series through terms of the fcth order. 

It will be recalled that the fundamental integral expression 
for Sk(&) involves a factor sin (k 4 - £) (t — sc), and otherwise 
does not change with k. The product of the expression 

1 3 

sin -g- v sin v + • • • + sin 

by 2 sin^v can be rearranged in the form 

[1 — cos v ] + [cos v — cos 2 ul+ • • • -f [cos ( n — 1 ) v —cos nv\ 
= 1 — coswu = 2 sin 2 (nv/2). 



Application of this identity, with v replaced by £ — x, gives 



sin 2 ^«(< — x ) 
2sin*4(f — x) 


At, 


or in terms of a new variable u — — x): 


<r n (x) = 


1 

n n 




sin 2 WM 

sin 8 M 


Au. 


If (sin*«M)/(sin 2 w) is denoted for brevity by <Z>» (u), the ex¬ 
pression becomes 

(x) = +2w) O n (u) Au. 
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The tact that the trigonometric factor O n (u) in the inte¬ 
grand, unlike the corresponding factor in the expression for 
S n (r), is never negative, has important consequences, and 
materially simplifies the reasoning. If f(x) in particular is 
identically 1 , each 8k (x) reduces to the single term 1 , and 
cr„ (.r) also is identically 1 , for all values of n: 


1 


_ 1 _ sin -— ac) , 

nn J—~ 2 sm a %(t — x) C 



More generally, then, if f(x) has M as an upper bound for 
its absolute value, 

l P KI ' 2 

I ■ <tn (a?) | ^ — J _ ff/a | f(x + 2 u) I d>n (w) d n 

i n 

M0n(u) dll — M] 

if \f(x) | < M fur all values of a:, then | a n (x) | < M for all 
values of x likewise, and for all values of n. 

For any specified value of x, the identity in the preceding 
paragraph, multiplied by the quantity /(as), which is constant 
as far as the variable of integration is concerned, states that 

whence 

ff n (f) ~f(x) ~ ~ J^ /2 [f{x + 2 u) —/(«)] ® n {u)du. 


Suppose now that f(x) is summable over a period, and 
continuous for x — Xo< Let c be an arbitrary positive 
quantity, and let <5< 4/2 be chosen so that \f(x o +2u)—f{x 0 )\ 
^ is for (if| < d. Then 


~S_ d lf (xn 21( ) (») £♦« 


< 



e0 n (u)du 


i r* /2 1 

’nrv J- 3 T/ 2 T e ° n ^ d “ 


2‘ 


for all values of ». On the other hand, since 0 n (u) <= l/(sin s d) 
for 6 < (u| 5 < n/2, 
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1 

nit 



—/(a’o)] ©» 00 du 


< 


1 

n n sin* <J 



|/(a? o +2?0 — fix 0 )du; 


the last integral exists, by the hypothesis of summability, 
and is independent of n, so that the whole expression is less 
than as soon as n is sufficiently large. Therefore 
I <*n (%o) — f(xa) | O, when n is sufficiently large; <r n (ar 0 ) 
converges toward fix o). If fix) is continuous everywhere, 
a n (a?) converges toward /(a?) for all values of x, and the 
convergence is uniform. For <5 in the proof can be chosen 
independently of xo, since fix) is uniformly continuous, and 
if M is the maximum of \f(x) [, 



|/0zo + 2 u) — f(xo) j du < 2Mn, 


which is likewise independent of x 0 ,' 

For convergence at a point of discontinuity, let fix) be 
summable over a period, and have a finite jump for x = xq. 
Let fixo) be defined as the mean of the limits approached 
from the right and from the left. Let a function ipix) be 
defined by the requirements that it shall be of period 2 it, 
and linear forxo<x<xo + 2n, and that ip (x 0 +) = fix o +), 
ipixo —) — fixo —), ipixo) — fix o). For x = x 0 , each 
partial sum of the Fourier series for rpix r) is exactly equal 
to ipix 0 ), as was pointed out in substance at the beginning 
of the proof of Theorem 13. The same is true therefore of 
the arithmetic means of these partial sums. But fix) is the 
sum of ipix) and a function continuous for x = av, and the 
arithmetic mean formed for the sum of two functions is the 
sum of the corresponding arithmetic means. So the arith¬ 
metic mean <r n ix) formed for fix) converges for x — x 0 to 
the value fix 0 ). 

The results on convergence obtained thus far may be 
restated in 

Theorem VII. If fix) is a summable function of period 2 it. 
and <J n ix) the arithmetic mean of the first n partial sums of 
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its Fourier series, as defined above, a n (x) converges toward 
fix) at ire) y point ithere fir) is continuous,, and converges 
toward the average of the limits approached from the right and 
from the left at every point where fix) has a finite jump. 
If f (./■) is continuous eierywhere, the convergence is uniform 
throughout. 

It should he mentioned as an immediate consequence of the 
definition of the arithmetic mean that if Snips) converges at any 
point, o n (x) converges to the same value. In fact, if So,Si,Ss,,--- 
is any convergent sequence of numbers whatever, with limit S, 
and if <r n = (S 0 +&-j- • • • + S n -i)/n., then <s n also converges 
to the limit S. Let S — S k = Bk, and let JST he chosen so 
that ! Bj, | < \ e for k N. Then, if n > Jf, 


S — <r n = 


nS—jSo + Sxf +£Sn-i) 
n 


__ -Rb ~4~ -Si -f~ • • • 4~ Bn —i 


_ -So~f~ 


n 

Bn— i 


n 


+ 


-5.y~f~ • • * ~l~ Rnr-x 
n 


the last fraction is less than in absolute value for all 
values of n> N, and the preceding fraction, in which the 
numerator is independent of n, is less than for n suffici¬ 
ently large. 


5. Degree of convergence of the first arithmetic mean 

As a first hypothesis for the study of degree of convergence, 
let fix) satisfy everywhere the condition 

l/(afe) —find | < A|a$,—a* |. 

In the integral expression for <r n (x) — fix), 

\fix-\-2u) — fix) | <A|2w|, 

so that 

21 P 71 ' 2 Al ptofe 

\ a nix) — fix) | —— I | it | <D n (u)du = - I uQ>niu)du. 

YlTZ J—71/2 7171 JO 
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In dealing with the last expression, it is to he remembered 
that u/ sin n n/2 for 0 <C « n! 2. Let the integrals from 
0 to 1 In and from 1 In to nt2 be considered separately. 
In the former, let the integrand be expressed by the formula 


u ©» (w) 


u . sin nu 

- -sin nu - : -: 

sinw sm u 


of the three factors indicated, all of which are positive for 
0<u ^ 1/w, the first does not exceed n/2, the second does 
not exceed 1, and the third does not exceed n. Hence 


n 

o uO n (u)du <; — 
In the integral from 1/m to n/2, 


u ®n(u) 


if 


• 2 1 ^ n* 1 

-7 o —sir«)i •— < —- 

snrw n — 4 tt 


and 

C - T-X. IT = x( log f + logM ) 


It follows that the whole integral from 0 to jt/2 does not 
exceed a constant multiple of logn, and \o n (x) — f(pc) | does 
not exceed a constant multiple of (log n)/n. A slightly 
more specific statement of the result is 

Theorem YHI. If f(x) is a function, of period 2 n satis¬ 
fying everywhere the condition 

\f(xi)— /(a?i)| ^ A|ajg — xi\ t 

and o n (x) the corresponding arithmetic mean, then, for n~> 1, 

!/(*)—»„(*) | < °‘ *‘ og ” , 

where Co is an absolute constant. 

Let f(x) have a derivative satisfying everywhere the 
condition 

I f 0%) —f 6»i)l ^ i \xs—ah |“, 
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with a positive value of «. The error then can he somewhat 
more narrowly restricted. The factor <D W («) being an even 
function of if, the substitution of — it for u gives the integral 
expression for the error the alternative form 

i f — r/2 

(f n (.n— /(or) =--- [/(*—2ti)—/(«)] ©»(«) du 

ni* Jn;2 

— P [/O'— 2 it) —f(x)] «2>„ (it) du, 
nn xfc 

and a third expression is obtained by taking the average of 
this and the original one: 

i r ! '~ 

—J [/(:r+2 if) — 2/Or) +f(x —2if)] <t> n (n) du. 

By the mean value theorem, together with the hypothesis 
now in force, 

/ Or+ 2 if) -f{t) = 2 «/'(&), 

/Or-2 if) -f{x) — -2if/'(? 9 ), 

/Or + 2 ») - 2/Or) +/(* - 2 if) = 2 u [f (&) -/' (&)], 
|/0r + 2 if) — 2/Or) — 2 m) | £ 2 A 1 1 * | • | $,— & |« 

< 2 2<l+1 A | n |«-H, 

the numbers and ? 2 being in the intervals {x, x+2 u) and 
(r —2 u, x) respectively, so that | ?i—£ 2 | < 4|u|. Hence 
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which does not exceed a constant multiple of IIn, the last 
integral being convergent. In more formal statement- 
Theorem IX. If f {or) is a function of period 2 n hating 
a first derivative which satisfies everywhere the condition 

| f (x 2 ) — f (Xi) I < l j a? a — a-! i« 

with a> 0 , and if a n (x) is the coiresponding arithmetic mean, 
then 

I/M — »«W I 

it 

where. 6 « is a constant depending only on «. 

No higher order of approximation would be obtained by 
supposing/(a?) provided with additional derivatives. In fact, 
the arithmetic mean corresponding to the analytic function 
f{x) =■ cos x is cr,i (a-) = [(?i— l)/n] cos x, and the error 
is actually of the order of 1 In. 

Let f{.r) be an arbitrary continuous function of period 2 n . 
with modulus of continuity « (<5). The theorem to be obtained 
in this connection is perhaps of secondary interest, because 
a closer result is given for an important class of cases by 
the theorem following it; but it also covers cases not admitted 
by the hypothesis of the later theorem, and so is not entirely 
superfluous. The proof is an adaptation of that of Theorem II 
in Chapter I. Let <p (#) be a continuous function of period 2 n 
which is equal to f{x) for a set of values of x dividing 
a period into n equal parts, and is linear between successive 
points of this set. This sp ( x ) satisfies the hypothesis of Theo¬ 
rem VIII, with h = [<» (2 7t/n)]/{2 n/n), and is represented 
by the corresponding arithmetic mean with an error not 
exceeding a constant multiple of m(2 n/n) log w. The absolute 
value of the mean corresponding to the difference /(.r )—f (x) 
does not exceed the maximum of the absolute value of the 
difference itself, which is not greater than 2co (2 n/n), and the 
error of this mean can not be greater than 4m (2 n/n ). Hence: 

Theorem X. If f{x) is a continuous function of period 2 n, 
with modulus of continuity m{8), and o n (x) the corresponding 
arithmetic mean, then, for n~>l, 
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!/(j") — (.s) I Co (a log n ; 

irhere Co is an absolute constant. 

Let fix) be subjected to the hypothesis that 

l/fe)— /fo) I ^ X\x* — xt |“, 


with 0 < « < 1. Prom the original integral expression for 
the error, 

I*. W -/Ml < ®* <”> 4 * 

X p*r /2 

— - I m“ ®n («) <#W. 

J!!T e /0 


Let the interval of integration again be considered in two 
parts, from 0 to 1/n and from 1 In to n/2. In the former 
interval, 


«“ ® n (u) 
so that 


u a . a I sxo.nu\ 2 ~ a ^ (n\ tt 
sin® u ’ Sm nU ’ \ sin it / = ( 2 ) ’ 

«* ®n (?«) du < n 1- ®. 


w 2 -®, 


Prom 1/n to n/2, 

U a ®n (u) < 


tt® ^_ 1 _ 

sin a M = 4 ’ u 2 ~ a ’ 



U a ®n {u) du 


7 ^_ r *' 2 du 71 * r°° du 
4 Jl/M U 2 ~ a 4 J 1 /n w 2 * 


71 s n 1 -® 

4(1—«)* 


Substitution of these inequalities in the formula for the error 
gives 

Theorem XI. If fix) is a function of period 2n satisfying 
everywhere the condition 

I/0e») —fix i) | < h | Xg — x t |«, 

with 0 < « < 1, aw# «/* o n i<x) is the corresponding arithmetic 
mean , Men 

I/O*)-«.(*) i < 

lb 

where & is a constant depending only on a. 
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Two outstanding facts with regard to the arithmetic mean, 
as compared with the simple partial sum of the Fourier series, 
are that the former converges for every continuous function, 
but does not reproduce identically a function which is itself 
a finite trigonometric sum. Consideration of Theorems SI. 
Yin, and IX, in the order named, throws some further light 
on the ability of the arithmetic mean to adapt itself to irre¬ 
gularities in the function represented, and its inability to avail 
itself of an exceptional degree of regularity. The hypotheses 
of the three theorems imply that 

\f(x-\-v) — 2/(as) +/(.r — r)| < ?-t\v\ a 


with a constant ^ in each case, and values of a successively 
less than 1 , equal to 1 , and greater than 1 (the present «, 
in the case of Theorem IX, taking the place of the number 
previously denoted by « + !)• The upper bounds obtained 
for the error of the arithmetic mean have the orders respect¬ 
ively of l/n a , (log n)/n a , and n a ~ l /n a . The corresponding 
upper bound for the error of the simple partial sum of the 
Fourier series is of the order of (lo gn)/n a in each case. 

For a concluding theorem with regard to the arithmetic 
mean, let /(as) be of period 2 n, summable over a period, and 
identically zero for « — ? ^ as < >8 + ?. If as has any value 
in («, /?), /(as 2 u) is identically zero for —<:ii 
Hence 

- is {Lt + 0 Ax+iv) 

Since 0>»(w) ^ l/(sin* £ 17 ) for < | «| < it follows that 


ff»(ac)| < -r-fY— ( f / + f, ) l/( aj + 2 «)l 

1 — WyrSHrJf*? \J-7c/2 Jvjla ' 

. |/(as+2it)| du. 


= nn sin 2 ^■/] J—ir/a 


In terms of the variable t = x-\-2u, 

J^^ a [/0& + 2w)| du = -g- 1/(0! 
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the limits of integration on the right having been adjusted 
by use of the frequently applied observation that the integral 
of a periodic function over a period is independent of the 
position of the initial point of the period. If the last integral, 
which is independent of x, is denoted by J, 

^ 2«7T sin® J 

The conclusion is 

Theorem XU . If f(x) is a function of period 2 n which 
is identically zero foti' ct — tj ^ x ft -f- «?, and summdble over 
a period, the integral of its absolute value over a period being J, 
and if <fn(x) is the corresponding arithmetic mean, then, for 
a<Lx</3, 

KC*0i ^ ~~7 

where Cy is a constant depending only on ij. 

Taken with the last assertion in Theorem VII, this shows 
that if f(x) is continuous for a —n < x and summable 

over a period, the arithmetic mean associated with it con¬ 
verges uniformly for a < x < fi. The results obtained by 
combining Theorem XII with Theorems VIJUL-XI need not be 
enumerated at length. 

6 . Convergence of Legendre series under hypothesis 
of continuity over a part of the interval 

The theory set forth in the early part of the chapter can 
be carried over in some measure to the case of Legendre 
series, the discussion being kept on the same elementary level 
which was maintained in the treatment of these series in 
Chapter I, to the extent that no use is made of an asymptotic 
formula for the Legendre polynomials. The next paragraphs 
will be devoted to a presentation of the analogies that work 
out most readily, though it will be seen that the treatment 
is left incomplete in several particulars. 

Let f (x) be summable together with its square for 
—1 and let Ok be the coefficient of Xjfx) in its 

Legendre series, 
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a k = ^±±£ if (0 d*. 

Let 

^»(a;) = a 0 X 0 (x) -(- a x Xi 0*0 • -f - «»3T»(a:), 

as before. By reference to the property of orthogonality of 
the polynomials X k (x) , together with the definition of a k and 
the fact that the integral of [X&(;r)] 2 over the interval (—1, 1) 
is 2/(2 &+1), it is seen that 

Li [fix) - S n (x)Y dx =£ [f(x)] s dx 

As the quantity on the left-hand side can not be negative, 
the sum on the right is bounded for all values of n, the 
infinite series obtained by letting n become infinite is con¬ 
vergent, and 

lim a*/(2fc + l) 1 / 2 = 0, 

7c=oo 

or, as equivalent statements, 

lim aic/Jc 1 '* = 0, lim W 2 C fit) X k it)dt = 0. 

fc = 0O fc=0O ** I 

The parallelism with the case of Fourier series becomes 
clearer if the polynomials X k {x) are replaced by the normalized 
sequence {[(2&-f 1)/2] 1/2 Xifx)), or, more superficially, if it 
is considered that the magnitude of the coefficient is less 
significant than the magnitude of the general term of the 
series: it was seen in Chapter I that |XfcOc)| does not exceed 
a constant divided by Jc 3 - 12 , when x stays away from the ends 
of the interval, and consequently 

lim ajcXhix) = 0 

k=oo 

uniformly throughout any closed interval interior to (—1, 1). 

Let fix) again be summable together with its square, and 
suppose now that it vanishes identically for a*,— ' r ilfx<^ xo +«?, 
the interval (a*>— q } xo+y) being contained in (—1,1). The 
value of S n (xo) can be expressed in the form 


5 * 
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Sn(.T«) = ^^-Xn + x{Xo) £ -^~ t X n (t) dt 

Each of the quantities X„+i (•»<)). A^Oo) is less than a con¬ 
stant divided by n 112 , and the factor in front of each integral 
is therefore less than a constant multiple of n m \ the quan¬ 
tity l/(ar 0 — t) is bounded over the range where fit) is 
different from zero, and — t) therefore is summable 

together with its square; consequently, by the preceding 
paragraph, the product of either integral by a quantity of 
the order n lla approaches zero. In other words, S n (x 0 ) con¬ 
verges to the value 0. If two functions/(a?), g>(%) are identical 
for x 0 — t] x ^ xo-\-Tj, the Legendre series for their differ¬ 
ence converges toward zero at the point Xo, and the series 
for fix) and <p{x) themselves converge or diverge together 
at that point: the convergence of the Legendre senes for a given 
function at an interior point of the interval (—1, 1) depends 
only on the "behavior of the function m the neighborhood of 
the point. 

As a preliminary to a discussion of uniform convergence, 
let fix) be any summable function with summable square 
over the interval (—1, 1), and let 

Akiy) = fit) Xi.it) dt. 

It is to be shown that k l > 2 Akiy) approaches zero uniformly 
for —1 < y < 1. The proof is largely a repetition of the 
corresponding argument for the case of Fourier series, hut 
there are differences of detail which are not altogether trivial. 
For any fixed value of y, Akiy) is the same as 

^fiifiXkitidt, 

if /i(t) is a function equal to fit) for —1 t y, and 
vanishing for y < t < 1, and consequently ft 1 / 2 Ak iy) approaches 
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zero for the particular value of y in question. Let e he any 
positive quantity. Let 

J(y) = X_", [/(*>? <lt. 

This function is continuous, and therefore uniformly continuous, 
for — 1 ^ y < 1. Let 6 he chosen so that 

\A(y&) — A(y t ) \ < ^ «) 

whenever | y s —?/i | <; d, y x and y a being points of the inter¬ 
val (— 1,1). Let X be the smallest integer such that 2 /N<.S, 
and let Zj — — 1 + (2j/N), j — 0, 1, • • •, X. The points ij 
being finite in number, there is a k' such that k ll ~\Ai e (s J ) < 
for all the values of j in question, whenever k ^ k'. For 
any value of y in (—1,1), let j be that one of the numbers 
0, • • •, N for which 0 < y — zj<i2/X<.S, and let Z = Zj. 
Consider the difference 

A k {y)-A k {Z) = £f(t)X k (t)dt. 

By Schwarz’s inequality, 

[£f(t) X k (t) [/(*)]’ dt£[X k (t)Y at 

= [A(y) -A(Z)]£ [X k (t )] 3 dt 

< [A(y)-A(Z)]^[X k (f)Ydi 

< ( y *) 3 ‘ 2 /( 2 *+ 1)<(-|-^) 2 A . 

Consequently 

k^\A k (y) — A lc (Z)\<~e. 

This is true for any positive value of k. On the other hand, 
if k ^ k', k L,2 \A k <lZ )\For such values of k, therefore, 
/f^l Ak(y)\ O; and k' is independent of y. Let e k be the 
maximum of | A k (y) | for — 1 < y < 1. The conclusion is that 
limfc=oo k 112 sje = 0. 
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Now let /( x) be summable together with its square over the 
interval (—1, 1), and identically zero for a — n x < £ + 

where — 1 <« — 17 < ,£ + 7 < 1 • Consider the expression 

J —1 3" t J —1 35 2 


on the assumption that x has a value belonging to the inter¬ 
val a x < /$; the function A n (t) has almost everywhere 
a derivative equal to f(t) X„(t). Let R(u) be a function 
equal to 1 fu for |u| and so defined for \u\<rj as to 
have a continuous derivative everywhere. Then, for any x 
in (a, fi), It(x — t) is the same as l/(a5— t) at all points of 
the interval — 1 ^ t < 1 where f{t) is different from zero, 
and the integral above is equal to 

R(pc — t) dt. 

The product A n (t) It (x — t) has almost everywhere a derivative 
with regard to t, equal to An(t) R(x — t) — A n (t) R!(x — t), 
and its increment over an interval is equal to the integral 
of this derivative, so that 

An(f) R(x — t ) dt 

= [^»(<) R(x— t)]l_ 1 +J^ x A n (f) R'(x—t) dt. 


Both R(v) and R'(u) are continuous for all real values of u, 
and approach zero as u becomes infinite in either direction. 
Let M 0 be the maximum of J _K (w) |, and M t the maximum of 
|i2'(«)|; these numbers are of course independent of as. In 
the right-hand member of the last equality, A n ( —1) = 0, 
|4»(l)l ^ e n, \R(x — 1)| < M 0 , while \An(t)\ <j s n , 
\R?(pc — #)| ^ Mi, over the whole interval of integration. 
Consequently 


^A! n (t)R(x — 


t) dt 


jMo £ n 2 AL\ €} t/ , 


Since lim» =0 o n 1/2 e n = 0, 
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„■» r mz& it 

J—l x - t 

approaclies zero uniformly for a x 0 . The same con¬ 
clusion holds if X n it) is replaced hy Zn+i it). On the other 
hand, n 1/z X^+i (a?) and n 112 X n ix) are uniformly bounded over 
(a, yd). Hence, on assembling the constituent parts of the 
integral expression for 8 n ix), it is seen that S n (cc) approaches 
zero uniformly over the interval in question: 

Theorem XIII. Jf fix) is summable together toitJi its square 
over the interval (—1,1), and identically zero over an interval 
« — *i ^ x < contained in (—1, 1), and if 8 n (x) is 

the ‘partial sum of the Legendre senes for fix), then S n (x) 
converges uniformly toward 0 for a <j x ^ yd. 

With the corollary of Theorem XI in Chapter I, this yields 
at once the further 

Corollary. If fisc) is summable together' with its square over 
the interval (—1, 1), and continuous over the interval a —n 
^ x yd-}-*7, with a modulus of continuity «(d) such that 
lim^o <»(d) log <5 = 0, the Legendre series for fix) converges 
uniformly to the value fix) for a ^ x <[ yd. 

7. Degree of convergence of Legendre series under 
hypotheses involving limited variation 

It remains to consider questions of degree of convergence. 
The discussion will be based on the properties of the Legendre 
polynomials already used, together with the identity 

Xnix) = 2n 1 +1 [Xn+l ix) — Xn-i (*)]. 

Let fix) be a function of limited variation for — 1 < x < 1, 
its total variation being V. Let cpiix) and <Psix) denote its 
positive and negative variations, measured from the left-hand 
end of the interval, so that fix)—fi —1) + 91 (®)— 9aix), 
<p 1 i —1)= y a (—1) = 0. By the second law of the mean, 

SP! (a) Xn ix) dx = SPi (1) Xn ix) dx 
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for some value of £ in (—1,1). But as a consequence of 
the identity in the preceding paragraph, with the observation 
that AVfi (1) == A,i_i (1) == 1, 

J! -An («'/') dx = <2 71 \ I [A»+i A)i—1 (x)]c 

For a sufficiently close estimate of the magnitude of the 
difference in the last bracket, there is occasion to go back 
to the identity 

An (a-) = ~ JT [•=*"+ * ( 1 — -* ,s ) 1/2 cos rf" d V • 

If this formula isusedto express the difference X n -i(x) —An+i(.«j 
by means of a single integral, the integrand is 

[x + i (1 — a ?) 112 cos y] n—1 {l — [x + i (1 — a 2 ) 172 cos y] 2 } 

= [aH-/(1— a?) x,i cosy]'*-^!—a: 2 ) 1/2 [(l-a^) 1/2 (l+cos 2 (p)-2 ix cosy]. 


For x in (—1,1), the absolute value of the last bracket 
can not exceed 4, and hence 

I AV-i Or) | < -^-(1 — \x-\-iiX~ a 2 ) 172 cosy | n ~ x d ^. 


On repeating, with slight changes of notation, the reasoning 
which led to inequalities for the Legendre polynomials in 
Chapter I, it is found that 


JJ* |a;-H(l—a^l^cosy |*<Zy = 2 ^ [x iJ r(l—x?) cos*y] (n ~ 1)/2 rfy. 
a^+ (1 — a; 8 ) cos 8 y < e~* 9, P* 


for the values of the variables that come into consideration, 
if z —(2/?r)(l—ic 3 ) 172 , and 

X a (n i:^ X 

J '*O0 

_ S 

o 6 “ du ' 
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With the observation that n—1 > for n ^ 2, the pre¬ 
ceding inequalities may be summarized by saying that there 
is an absolute constant e such that 


Xn—l (#) — X n +i (x) | < c/n 11 ' 2 


for n 1 and for — 1 ^ x < 1; it is clear that the case 
n — 1 can be included in the final statement. (Strictly 
speaking, of course, the result is obtained directly for jrc| < 1, 
and extended to the ends of the interval by continuity.) 
The essential point is that there is no factor (1—a 2 ) 1 / 2 in 
the denominator on the right, as there was in the corres¬ 
ponding inequality for a single Legendre polynomial; its 
absence is due to the factor (1— x 2 ) 1/2 which comes out before 
the integral sign in the expression for the difference. 

Applied to the problem in hand, the inequality just obtained 
shows that 


\L 


Vx(a?) X n (x)dx 


< SPi (1) £ <■ cyi(l) 

= 2 m + l" n lr2 == 2 n sl2 


Similarly, the magnitude of the integral formed with <jp 2 (.0 
in place of cp x (x) does not exceed c$p 2 (l)/(2w </2 ), while for 
n > 1 the integral corresponding to the constant /(—1) is 
zero. Consequently, as (1) + SPs (1) == V, 


ff{x)X n (x)dx 


cV 

2 n 2/2 ‘ 


The analogue of Theorem III is 

Theorem Xiy. If f{x) is of limited variation for — 1 < x 
< 1 , its total variation being V, and if a n is the coefficient 
of X n (x) m its Legendre series , then, for «>0, 


| On | — 


JBoV 

n ll2 > 


where So an absolute constant, so that 


\a n X n (x) | 


< s 1 v_ 

— n 


for — 1 + // ^ x 1 —th the constant depending only 
on rj. 
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Now let it be assumed that fix) has a first derivative with 
limited variation, and let the total variation of f ip) for 
— 1 <£ x < 1 he denoted by V. (In more general terms, it 
would be sufficient that fix) be the integral of a function 
of limited variation, and not necessarily provided with a unique 
derivative at every point.) For the moment, let 

Tnix) = ^XniOdt. 

By integration by parts, 

J^/(jc) Xnix) dx = J^/C x) Tnix) dx 

= [/(*) Tnix)-\l_ x — \l\/ix) Ynix) dx. 

In the last member, r»(—1) = 0, and for n 1, F »( l ) = 0 
also. Furthermore, by the identity already employed, 

I» 0®) = 2n 1 +i - [Xn+i ix) - Xn~i ix )]; 

the terms coming from the lower limit of integration cancel 
each other, since X»+i(—1) = JT n -i(—1) = (—1)" +1 . Hence, 
with the assumption that n 1 , 

J^/0*;) X«(ar) dx = - - q- - §_ x fix) [X>-i ix)—X n+1 (x)] dx. 

li the integral on the right is taken as the difference of two 
integrals, Theorem XIV, or, more directly, the inequality 
immediately preceding the statement of that theorem, can be 
applied to each. It is to be noted once more that 1/in —1) 
< 2 In for «> 1. The resulting general formulation, however, 
does not hold for n = 1, as may be seen by taking fix) = Ox, 
in which case V = 0, while O may be arbitrarily large; the 
conclusion is essentially restricted to values of n 2. In¬ 
duction based on repetition of the process of integration by 
parts leads to a conclusion which may be stated as 
Corollary I. If fix) has ajoth derivative of limited variation, 
P ^ 0, and if V is the total variation of fW ix) for — 1 1, 

then, for n >j>, 
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| 


< j'g I 

= W P+(1/21 > 


uheare Up depends only on p, and 


| -Xn (*) | 


R PV V 
HP + 1 


for —— t], where S p7J depends on p and on y, 
but not on x or on n. 

Suppose fisc) is of limited variation for —1 < x < 1, with 
total variation V, and identically zero for a — jy < cc ^ /3 + jy. 
Let R (u ), as before, be equal to 1 /u for f ^ rj, and defined 
in some way for | u j < rj so as to have a continuous derivative 
everywhere. If M x is the maximum of {JB' 001» the total 
variation of R{u) over any interval of length 2 does not 
exceed 2 M X . The quotient f(t)l(x — t) is identical with 
fit) R{x — t), if x has any value belonging to the interval 
« <1 x ^ fi. It is of limited variation, regarded as a function 
of t, and its total variation has an upper bound which may 
be expressed as the product of V by a quantity depending only 
on y, and, in particular, independent of x. So Theorem XIV 
may be applied to the function /(i) R(x — t), and the result 
substituted in the integral expression for S n (x) , to prove 

Corollary II. Iff (a ;) is of limited variation for —1 < x ^ 1 , 
with total variation V. , and identically zero throughout an 
interval a — <^x < contained in (—1, 1), and if 

S n (x) is the partial sum of its Legendre set ies, then, for 
n > 1 and a < x < fi. 

C r Y 

I #*0*01 ^ “J—» 

where Cy depends only on 

Further consequences can be deduced substantially as in 
the case of Fourier series. Tif(x) has a bounded derivative 
for —1 a; <; 1, it comes under the hypothesis of the 
Corollary of Theorem XI in Chapter I, and its Legendre series 
converges to the value f(x) for — l<a?d. If the series 
converges uniformly for —l < x < 1, its sum agrees in value 
with fix) at the ends of the interval also, by continuity. 
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Ho Corollary I above yields information as to the order of 
magnitude of the error, under the hypotheses indicated. A result 
may be stated for the entire interval (—1, 1), as "well as for 
an interior interval, since | a n X n (x) | < \ an | for — 1 < x < 1: 

Theorem XY. If f(,x) has a pth derivative with limited 
variation, p > 1, if T’ is the total variation of :) for 
— 1 < x < 1, and if fi n ix) is the partial sum of the Legendre 
series for fix), then, for n >p, 

\f(x)—fi n u ); <; 

for —1 < x< 1, where Q' p depends only on j p, and 

for —l+jy < a: < 1— *i, where Q P y depends only on p and on y. 

In conclusion, the following consequences of Corollary II. 
taken first with the corollary of Theorem XI in Chapter I 
and then with the Theorem XY just formulated, may be 
stated without further comment: 

Theorem XYIa. If fix) is continuous with modulus of 
continuity <*>(d) throughout an interval a—g < x < ftv 
contained in (—1, 1), where to(d)>0 for d > 0, and of 
limited variation over the rest of the interval (—1, 1), then 

I fix) — S n ix) | ct»(2/n) log n 

for a<x <ft, ifn is large enough so that w(2 In) has a meaning, 
fin. ( x) being the partial sum of the Legendre senes for fix), 
and c a constant depending neither on x nor on n. 

Theorem XVIb. If fix) is of limited variation for — 1 
< sc ^ 1, and has a first derivative of limited variation 
throughout an interval «— ij < a? < ft -j- y contained in 
(—1, 1), then 

\fi x ) fin ix) | 

for « < x < ft r where S n (x) is the partial sum of the Legendre 
series for fix), and c is a constant depending neither on x nbr 
on n. 



CHAPTER III 

THE PRINCIPLE OP LEAST SQUARES 
AND ITS GENERALIZATIONS 


I. Convergence of trigonometric approximation as 
related to integral of square of error 
In the discussion of Fourier series hitherto, scarcely any 
mention has been made of one of their most remarkable and 


important properties, namely their relation to a problem of 
least squares. The property in question is not peculiar to 
Fourier series, but is of much wider significance. 

Let p 0 (a?). 2 h (x ), p s (as ), • • •, be a sequence of normalized 
orthogonal functions over an interval (a, h), that is, a sequence 
of functions satisfying the conditions 


f a Vj 0)p*.(a?) dx = 0 0 + X Lp*. (•'■.)]* tlar = 1. 


Let fix) be another function defined over the same interval. 
The functions p* ix) and fix) may be continuous, or, more 
generally, bounded and integrable in the sense of Riemann, 
or merely summable together with their squares. The problem 
of least squares in question is that of determining a set of 
coefficients Co, c n , for a given value of n, so that when 


SpO») = c-oPo (•«) + Cl Pi (x) + ... + CnPn (a") 


the integral 

J a t/O)— <?ix)Ydx, 


regarded as a function of its coefficients, shall be a minimum. 
The following theorem is well known: 

Theorem I. The integral has a minimum value, for the 
attainment of which it is necessary and sufficient that 

cj c — f a f(f)jpjc(x) dx, h = 0,1, 
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It is to be noticed at the outset that no function of the 
form 9 (sc) can vanish identically over the interval unless all 
the coefficients are zero; in other words, the conditions 
on the p 's insure that they are linearly independent. For 

f a [9(x)] a dx = cg-bcf#-+ <&> 

and the identical vanishing of 9 would require the vanishing 
of the sum of the squares. More definitely, it is recognized 
that the value of the integral on the left must be positive, 
if the c’s are not all zero. If the jp’s are continuous, this 
means that there must be one or more intervals throughout 
which 9 remains different from zero; if they are not so 
restricted, it means in any case that 9 must be different from 
zero over a set of positive measure. 

To prove the necessity of the condition, suppose that 9 (x) 
is a function of the form CkPk (x), in which, for a particular 
ndex k = m, 

Cm +X f(x)p m (x) dx. 

Let 

V (*) — Co Po ( x ) 4- Cj pi (x )H-Ke wi + h)p m (asH-f c n p n (sc) 

= 9&)+hpm(x), 

where h is a constant, the value of which will be specified 
presently. Then f(x) — tp(x) = f(x) — 9 (x) — hp m (x), and 

£ I/C 35 ) - ty(x)]*dx = J* a [/(a?) — 9 (x)] a dx 

— ^*) — 9P (»)]Prn (x) dx+h l [p m (x)\*dx. 

The last integral in this equation is equal to I; if the others 
are denoted by J, JR, 8 respectively, 

J — B — 2 hS+h*. 

But 9(x)p m (x)dx — Cm, and 

8 ~ £ [/(*) — 9(^)1 Pm (x) dx —j* a f(x)p m (x)dx — Cm#: 0, 
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by hypothesis. Let h be taken equal to S-, then 

J — B — 


and the value 22 corresponding to the function 9 is not the 
smallest value of the integral, since the function ty gives 
a smaller value. 

To show that the condition is sufficient, suppose now that 

9 = £ck pie with Cfc = I fix) pic(x) dx for k — 0, 1, • • n. 
and let 

lfj(x) = Co Po (x) + ci jpi (a?) + • *• -fc'nPnix), 


where at least one cL is different from the corresponding <1. 
Then 9 (x) — tyix) is a linear combination of po, Pn, with 
coefficients which are not all zero, and 


V — f [y> (x) — ip 0*)] 2 dx> 0 . 

e/& 

On the other hand, 

f»b nb 

J a f(x)pjc(x) dx = Cle = J a 9>(x)pk(x) dx, 

X L/(#) — 9 (a:)] !*(*) dx — 0 , k — 0 ,1 , 


so that if x&) — Pk (*) JS an y linear combination of 

Po,“',Pn, 

X [/(») “»(*)] *(*) dx 

= Ck X [/(*) “*(*)] Pk(ps) dx = 0. 

In particular, 

U —X tA 35 ) — Sp(*)] [?>(*) — V'( aj )] dx — 0. 

Hence, by the resolution f — ^ — (/— y) + (9 — V0> 

X*[/(as) — ^(a?)]* dx = X [/(*) — 9 Ob)]*<* a?+ SIH-F 

>X LA 33 ) — sp(*)!* **» 
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which means that y> actually gives the integral a smaller 
value than any other linear combination of p 0 . • • *, 

As the most important of specific cases, p 0 , lh> P> »• • • may 
be the sequence of functions 

1 cos# sin# cos 2a? sin 2# 

( 2 tt ) 1 / 2 ’ Tt 1 !- ’ TtW ’ n ’ tt 1 ' 2 ' *”■ 

If n above is replaced by n' — 2 m + 1 , g pipe) is a trigonometric 
sum T n (or) of the wth order. A periodic function/(cc) being 
given, the condition that the integral 

XI[ Ax)—T n (x)] a dx 

shall be a minimum, is that the coefficient of cos kx/rt 1 ' 2 be 



for k> 0, with a corresponding determination of the constant 
term and the sine terms, and this is equivalent to saying 
that T n (x) shall be the partial sum of the Fourier series 
for fix). 

This property of least squares may be taken as a point of 
departure for a discussion of the convergence of the series, 
under suitable hypotheses with regard to fix). While the 
hypotheses are more restrictive than those used in the con¬ 
vergence proofs already given, the method is of interest for 
its own sake, and furthermore lends itself to a remarkable 
variety of generalizations in other directions. It depends on 
theorems of approximation from Chapters I and II, and also 
on a proposition, due in the first instance to S. Bernstein, 
the simplicity and importance of which are noteworthy in 
the highest degree: 

Bernstein’s Theorem. If T n (x) t$ a trigonometric sum of 
the nth order, and if L is the maximum of its absolute value, 
the absolute value of the derivative Tnix) can not exceed hL. 

The proof to be given here was devised independently by 
Marcel Riesz and by de la Vallde Poussin. For the purposes 
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of the demonstration, the theorem may be restated in the 
following equivalent form: 

If Tn Or) >s a trigonometric mm of the nth order, and if 
the. maximum of | Th Or)| is 1, the maximum of'T n (x)[ itself 
can not be less than 1 hi. 

Suppose the maximum of \ T„(or) | were less than l/«. 
Then, for any value of the constant C, the function 

R n Or) = -- cos (nx — C) — T n (x) 
n 

would have the same sign as cos (nx — C) at each of the 
points C/n, (C-\-n)/n, (C-\- 2 n)/n, —, (Chi)-\- 27 t. where 
cos {nx — C ) takes on the values +1 and —1 alternately. 
Hence i?» (x) would vanish at least once in each of the 
2 n intervals between successive points of this set. say at 
the points aj lf a? 2 . * • •, .r 2 », all contained in an interval of 
length 2n. By Rolle’s theorem, Rh(x) would vanish between 
and ,r 2 , between a 2 and a- 3 , etc., and also, as S»(a*x + 27r) 
— Rn(xi) — 0, between xz n and x l - s r2n; and the 2 n distinct 
roots of JRA thus specified all lie within the interval from 
to x t -\~2 n. Explicitly, 

Rn(x) — —sin {nx — C ) — Tn(x). 

If C is chosen so as to make sin (nx — C) equal to the 
negative of Tn (x) at a point where Th (x) = dh 1 > Rh (a:) will 
have a double root at this point; for each of the functions 
sin (nx — C), Th(x) separately attains a maximum or a minimum 
there. Having 2 n distinct roots in a period, and at least 
one double root, Rh has roots of aggregate multiplicity at 
least 2w+l. But this would require that Rh, as a trigonometric 
sum of order n, vanish identically, which is impossible, since 
Rn is sometimes positive and sometimes negative, by hypothesis, 
and so can not be constant. Consequently the assumption 
that 1 T n {x) | remains less than 1/n is inadmissible. 

(The fact that a trigonometric sum of order n can not 
have roots of multiplicities aggregating more than 2 n without 
vanishing identically is assumed here as known. A special 


6 
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case of it, sufficient for the present application, is proved 
incidentally in another connection in Chapter TV below.) 

To proceed with the proof of convergence, let /(») be 
a continuous function of period 2 jt , and S n {x) the partial 
sum of its Fourier series to terms of order n, and let 

*-£[/«-aw*.. 

Let t n (x) be a trigonometric sum of the nth order, sub¬ 
sequently to be chosen so as to furnish an approximation 
to /(«), but arbitrary for the present, and let 

f(x )— t n (x) — r(x), S n (x) — tn(x) = r n (x). 

Then r —t„ is identical with f —8 n , and 

^Jr(x) — T n (x)Ydx = Yn. 

Let e n be an upper bound for the absolute value of 
k(»)i = !/(«)— tn(x)\ 

for all values of x. Finally, let /■»» be the mfl-irimnnn of 
ka( 55 )|, and xa a point where | r n (xo) | — n n . 

By Bernstein’s theorem. 

Wn{x)[<niJU n 

for all values of x. For \x — xo | < l/(2n), consequently, 

I*■»(*)— ^(ico) j < j T n (x) I > 

If (the contrary case being reserved for separate 

mention), 

throughout the interval specified. As the length of the 
interval is 1 In , 
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whence p n < 4 (tiy n ) 1/ ' 2 . To suppose, on the other hand, that 
*n !/«», is to assume outright that i*n<. 4«». So in either 
case /*» does not exceed the larger of the quantities 4 s„. 
4(ny n ) V2 , or, in a single formula, 

(*n < 4(»r„) 1/2 H- 4t n . 

But | ]</*„, |/'|<*», and >• — r n is the same as f — S„. 

Consequently 

1/0*0— 8 m (jj) . < 4 (n y n ) v ' 2 + 5#„ 
for all values of x. 

The function fix) being continuous, it is always possible 
to choose tnix) for successive values of n in such a way 
that limn^ao e n = 0. So the relation just obtained amounts 
to a proof of 

Theorem II. If fix) is a continuous function of period 2n. 
its Fourier series will converge uniformly to the value fix), 
provided that 

lim iiYn — 0. 

n=oo 

In particular, Theorem VI of Chapter II yields at once 

Corollary I, A sufficient condition for uniform convergence 
is that fix) he absolutely continuous. 

By Theorem II of Chapter I, on the other hand, if at id) 
is the modulus of continuity of fix) , there will exist sums 
tnix) such that 

\fix) — tn ix) I <K'(o (2tt/w), 
where K' is an absolute constant. Then 

J ^j.f (x) — tn (a0] s dx < 2 n [K'ta (2 tc! w)]*. 

But as the integral takes on its minimum value when S n ix) 
is put in place of tnix), the right-hand member of the last 
relation is an upper bound for y n - So ny n will approach 
zero if n ll2 mi2n/n) approaches zero, and it is possible to state 
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Corollary II. A sufficient condition for uniform convergence 
vs that lim^o <o (d)/d l t 2 = 0, wh&'e a>(6) is the modulus of 
continuity of f(x). 

The conditions of Corollaries I and II of course overlap 
to a considerable extent, but they are not coextensive, and 
neither includes the other. As previously mentioned, they 
are less general than others which have already been obtained, 
absolute continuity is more stringent than limited variation, 
and the requirement involving 6 m is more restrictive than 
the Lipschitz-Dini condition. 

The novel possibilities of the present method will be more 
apparent, if it is remarked that in the proof of Theorem n, 
down through the final relation of inequality preceding the 
statement of the theorem, no use whatever was made of the 
assumption that S n (x) was the Fourier sum for fix), and 
the value of the integral y» a minimum; the argument can 
be repeated step by step with 8 n (x) replaced by T n (x), an 
arbitrary trigonometric sum of the nth order, and y n replaced 
by g n , the integral of [/(sc) — T n (sc)] 2 over a period. The 
conclusion is 

Theorem II a. If fix) is a continuous function of period 
2 tv, T n (x) an arbitrary trigonometric sum of the nth order, 
and 

gn =£ [/(«)- TnixWdx, 

and if there exists a trigonometric sum tn (x), of the nth order , 
such that 

\f{x) — t n (a?) I < e n 

for all values of x, then, for all, values of p, 

|/(s) — T n (x)\ < 4 (ng>f 12 + 5 

To this may be added immediately 

Theorem Hb. If sums Tn (sc) are defined for, infinitely 
many values of n in 'such, a way that g n Ay n , where A' is 
independent of n, the sums T n (x) will converge wniformlyrto 
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the value fix ), as n becomes infinite, under the conditions of 
Theorem II or either of its corollai les. 

The next application depends for its simplest expression on 
an existence theorem, the proof of which will be postponed 
for a few pages, to avoid interruption of the current order 
of ideas. The statement of the theorem is as follows: 

Theorem III. If q(x) is a summable function of period 2 n 
winch is nowhere negative, and is different from zero over 
a point set of positive measure in a period, if fix) is a sum- 
mdble function of period 2 tt such that (KaOL/Xa*)] 3 is also 
summable, and if Ti (x) is a trigonometric sum of specified 
order n, the integral 

[/O) — T n 

has a minimum, which ts attained for one and just one deter¬ 
mination of the coefficients m T,iix). 

The truth of this assertion being assumed for the moment, 
attention will be directed to the question of the convergence 
of Tnix) toward fix) as n becomes infinite, the function fix) 
being supposed continuous. The problem is connected with 
an extensive theory developed from a different point of view’ 
by Stekloff, SzegO, and others. The minimizing sum may be 
called the approximating sum of m'dei n coi responding to the 
weight function q Or), and it will be understood now that 
T n (x) denotes this particular sum. The corresponding minimum 
value of the integral will be denoted by G n . For the con¬ 
vergence proof, let q(x) be further restricted by the hypo¬ 
thesis that its values are comprised between two positive 
bounds: 

OO ^ $(as) ^ V, 

yrtiere v and V are independent of x. 

By the minimizing property of T» (a 1 ), 

Gn — q (as) [/(a?) — T n (as)] a d x 

^ JL. Q (*) [/(as) — Sn ix)]*dx < Vy n , 
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where S n (x) and y n have their previous meanings. On the 
other hand, with the notation of Theorem Ha, 

On > v j_ r [f(oc)— T n (x)Y dx = vg n . 

So ffn £ (V/v)y n , and the conclusions of Theorem II b are 
applicable: 

Theorem IV. If the weight function q(x) has a finite upper 
hound and a positive lower hound, the approximating sum T n (x) 
will converge uniformly toirard fix) as n becomes infinite, if 
lim w =o6»y» = 0, anil hence, in particular, if fix) is absolutely 
continuous, or has a modulus of continuity <»(<?) such that 
lim^>=0 «(dydV 2 = 0. 

In modification of this result, let the restriction |(>(a:)| < V 
be removed, the hypothesis with regard to q being merely 
that it is summable, and that q > v > 0 everywhere. Since O n 
is a minimum, 

On < [fix) — tn ix)] s dx 

if tnix) is any trigonometric sum of the nth order. Let e„ 
be an upper bound of \f —1 »\, as before. Then 

O n < dx = lei, 

with the notation J*g = I. As it is still true that O n ^ vg n , 
it follows that g n < (J/u)^. Under the hypothesis that 
linv_ 0 »(dj/d 1 / 2 = 0, tnix) can be chosen so that lim«=oo n lf2 e n 
— 0, which means that lim n =oo = 0, and Theorem Ila 
shows that T n (a?) converges uniformly to fix) once more. 

a. Convergence of trigonometric approximation as 
related to integral of mth power of error 

To generalize in another direction, consider the integral 
Q 0*0 1 / 0*0 — Tn ix) \ m dx, 

in which m is an arbitrary positive number, taking the place 
of the particular exponent m — 2 that has been used hitherto. 



III. PRINCIPLE OP LEAST SQUARES 


87 


When m, n, q(x), and /(a;) are given, fix) being bonnded 
and measurable, it can be shown that there exists a deter¬ 
mination of the coefficients in T n (sc) for which the value of 
the integral is a minimum, provided that q(t) is summable. 
everywhere positive or zero, and different from zero over 
a point set of positive measure in a period. When m > 1, 
the determination is unique. The existing presentations do 
not give the theorem with quite this degree of generality, 
but it is not difficult to supply the necessary extensions. Eor 
somewhat less general functions q (in particular, for the case 
that q = 1 identically) it has been shown that the deter¬ 
mination is unique for m = 1 also, if fix) is continuous. 
It is not generally unique when m <. 1. The proofs will be 
omitted here, the facts with regard to convergence being 
formulated in such a way as not to presuppose a knowledge 
of the theorems of existence and uniqueness. 

For the discussion of convergence, let fix) be a continuous 
function of period 2 n, and let it be supposed that there is 
a number v such that 

q ix) > v > 0 

for all values of a?, while q (as), as always, is summable over 
a period. Let 

9n = G») |/(®)~ T n ix) \ m dx , 

T n ix) being for the present an arbitrary trigonometric sum 
of the nth order. (The notation is changed somewhat from 
that previously used in treating the case m = 2.) Let it be 
assumed that there is a sum tn ix), likewise of the nth order, 
such that 

| fix) — t n ix) | < e» 
for all values of x, and let 

fix) — tn ix) = r ix) , Tn ix) — tn ix) — T n (a;) , 
whence 

j^ K Q 0*0 I r ix) — T n ix) | m dx — g n . 
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If p, lt = j C/l (x 0 ) | is the maximum of j r n (as) |, it follows that 
i Or ) | < by Bernstein’s theorem once more, and there 
is an interval of length at least l/n throughout which 

! (as) — t,i (as 0 ) | < ! v n (x) | 2^ -l*n, 

i r (as) — ?» (as) | > 
provided that e> 2 < J . Then 

<7»> -f -(■^■)* ^<4 (ngjvyi™. 

Whether «« < i or not, 

/«» < 4 {ngJvY !ni + 4 , 

and 

|/(a;)—(as) | = j >-(as) — t„U) | < 4,{ng n /vy lm -\- 5«». 

For each value of n, let be the greatest lower bound 
of the integral g n , when all possible values are given to the 
coefficients in T n (as) , while the functions $ and f and the 
exponent m are held fast. It is clear that y n > 0. Nothing 
is assumed as to the possibility of making g n actually equal 
to y n . by one or more determinations of the coefficients. 
As g n can not be less than y n for any sum of the nth order, 
it follows in particular, sinee 4 (as) is such a sum, that 

Yn < (a?) | /(as) — 4 (as) \ m dx < Ie™, 

the value of JV being denoted by I. If sums T n (x) are 
chosen for the successive values of n so that g n < Ay n , where 
J. is independent of n, then gjl m < (AT) llm e n , and 

|/(a?) — T n (as) | < Bn 1/m e n , 

where B is likewise independent of n . The sums T n (as) will 
converge uniformly toward /(as), if n llm e n can be made to 
approach zero, and this will be possible under conditions 
indicated by Theorem II of Chapter I, if m> 1; by the 
Corollary of Theorem IV of that chapter, if m = 1, and by 
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Theorem IV itself, if 0<«i<l. The principal results may 
be summarized in the following formal statement: 

Theorem V. If the weight function q(x) is summable and 
has a positive lower bound, and if a sum T n (x) of the nth 
order is chosen foi each of an infinite set of values of n so 
that g n ^ Ay n , where A is independent of n, these sums will 
converge uniformly toward figs), token m is held fast andn is 
allowed to become infinite, if m > 1 and fix) has a modulus 
of continuity <o{d) suck that lim 4 f_ 0 ®(<J)/<J 1 ' , ” i = 0, or if 
m— 1 and fix) has a continuous derivative. 

The explicit statement for m < 1 is more complicated and 
less interesting. In the cases covered by the theorem as 
stated, for m > 1, there is in fact a determinate approximating 
sum for each value of n, making g n — y n , and the conclusions 
of course apply in particular to the convergence of these 
approximating sums. On the other hand, it is clear that A 
can be replaced by a factor A n which becomes infinite with n, 
if more restrictive hypotheses are placed on fix), so that 
can be made to approach zero with sufficiently increased 
rapidity. 


3. Proof of an existence theorem 

To carry out an intention expressed above, it remains to 
supply a proof of Theorem III, with regard to the existence 
of an approximating sum corresponding to a given weight 
function q (x) when m — 2. 

It will be worth while to begin with more general con¬ 
siderations. Let qo(x), qt(x), <&(■»),•••, be a sequence of 
functions, each summable together with its square over an 
interval (a, b). Let these functions be linearly independent, 
in the sense that any linear combination of a finite number 
of them, with coefficients not all zero, is different from zero 
over a point set of positive measure in (a, 6), which is equivalent 
to saying that the integral of the square of such a linear 
combination, extended from a to b, is positive. The requirement 
may be indicated briefly by saying that the g’s are properly 
independent Let 
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I Qo 

*“* >■ 


JftQo 

‘ Qo /a ? 

/« 

I £s Qi 

-a , 

/« 


and generally 


*—1 

Qft — Qk Q/ 


/juft 

W' 


it is understood that each integration is to he performed 
with regard to x, from a to l. Each denominator is different 
from zero, because of the hypothesis of proper independence. 
It is seen from the definition of Qi that J Q 0 Q L = 0. Since this 

is the case, the definition of Q a makes J* Q 0 Qs — § Qi Qi — 0. 
Generally, each Q is orthogonal to all the preceding Q’s, or, 
in other words, any two of the Q’s are orthogonal to each 
other. If 

Pk(x) = Qk/ (J* q£) , 

the functions pk form a normalized orthogonal sequence, to 
which Theorem I is applicable. Each pk is a linear combina¬ 
tion of q 0 , •••, qk, and conversely each qk can be linearly 
expressed in terms of pk and the p’s of lower order, since 
the coefficient of qk in the expression for pk is not zero. 

Now let q(x) be a non-negative summable function of 
period 2 re, such that 

f* K Q(x)dx> 0, 

and let fe(as)] 1/a = w(x). The functions 


The functions 


w(x), w(x)cosx, w(x) sin x, w(x) cos 2 x, w(x) sin 2 x, ■ - • 

are summable, with their squares, and properly independent; 
a linear combination of a finite number of them, with co- 
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efficients not all zero, is a trigonometric sum multiplied by 
•w(x), and is different from zero wherever e + 0. with the 
exception of a finite number of points at most in a period. 
If they are taken as the functions qo, gj, q&. • ••, of the 
preceding paragraph, over the interval (— n, jt), the corre¬ 
sponding functions p 0 ,Pi,Ps< • have the form 

ir(x)U 0 (x), w(x)U 1 (x). n(c/‘)Vi(x), ic(sr) Z7 s (.e), 

ic(x) F a (.r), • • - . 


where Ul and T 7 ?.- are trigonometric sums of the A-th order, 
and 


£*(*) Uj(x) U k (x) d :r = dx = 0 (j * k\ 

(*7Z 

Uj(x) Ta( a,) dx — 0 for ally and Ar, 
( x ) [Uk(x)] a dx — Q(x) [FfcC#)] 1 dx — 1. 


If T n (x) is any trigonometric sum of the wth order. u-(x) T n (x) 
is a linear combination of the functions now taken as the q’s. 
and so is a linear combination of the functions to(x) Uk(x), 
w(x)Vjc(x), k = 0 , 1 , •••,«, and T n itself is expressible in 
terms of Uk, Yk, with the same coefficients. 

Let f(x) he a summable function of period 2 nr, such that 
q(x) [/(cr)] 8 is also summable. Then 

LA*)— T n {x)Y dx — (*)/(*)— w(x) T n (x)] a dx, 

and Theorem I can be applied, with w(x)f(x) as the function 
for which an approximation is sought. The integral has 
a minimum value, and if 

Tu (ar) = Ao Ui o (ar) + Ax U 1 (x)+ -h A n TJ n (x) 

+ B\ (x) -f- * • • "t - Bn Yn (ar), 


it is necessary and sufficient for the attainment of the minimum 
that 
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Ak = ir(x)f(x) • w(x) Uh(x) dx =J^ K e(x)f(x) U k (x) dx, 
Bk = (pa) Vk(x) dx. 


4. Polynomial approximation 

Attention will next "be directed to problems of polynomial 
approximation. An essential instrument here is the form of 
Bernstein's theorem in which Bernstein himself was primarily 
interested: 

Bernstein’s Theorem for Polynomials. If P» (x) is 
a polynomial of the nth degree, and L the maximum, of 
! Pn(x) | foi —1 ^ x ^ 1 , then 

\Pn(x)\ < a 

for —l<a“<l. 

It can he deduced immediately from the theorem for the 
trigonometric case. Let x — cos 9. Then P n (x) = JP n (cos 6) 
is a trigonometric sum of the nth order in 0, having L for 
the maximum of its absolute value, and consequently 


A. 

d9 


P n (cos 0) 


Isin0P,Ucos0)| = |(1— x 3 ) 113 Pn(x )| <; nL, 


whence I P» (x) | < nLJ(l — of) 11 ' 1 . 

If L is the maximum of \P n (x) | for a <x < b, let 

2x—a — b 

V - - T- a 

Then P n (x) — Qn(y) is a polynomial of the nth degree in y, 
having L for the maximum of its absolute value in the 
interval —1 < y < 1. Therefore 


dy 


Qn(y) 




2 |-P»(*®)I 5 - ( 1 _^2Jl/2 


by substitution of the value of y in terms of a; in the factors 
(1— V)> (1+2/) under the radical sign, one is led to 
Corollary I. If P n (x) is a polynomial of the nth degree, 
and L the maximum of \P n (x) | for a<x<b, then 
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|JSW l s 1®=3^=5SF 

for a<.x<.b. 

This implies directly 

Corollary II. If P„(x) ts a polynomial of the nth degree, 
and L the maximum of |P n (a*)| /or a < r < h. and if 
a<. a i <ib 1 <h, then, 

\Pn(x) \ < hnL 

for ai<_x< bi, where 7i depends only on a, a l} h x , and h. 

A further corollary is to be obtained by the following 
considerations. Let P n (x) once more be a polynomial of 
the wth degree, and L the maximum of its absolute value 
for — 1 < x < 1 . Let x be a number of the interval 

— 1 < x < 1, and <5 a positive number < 1 such that 

— l<x— 6 < 1 . Then 

| P n (x) -P n (x - 3 )! = | £_^P'(t)dt j < . 

It is readily seen that • 

r* dt ^ r 1 at 

j x -* (i - - p) 1 - * 


(For an analytical proof, let 6 — 2 ?, x —= y, so that 
x —6 = y y , x = y-\-y, 


Since 


f“ dt 

X-d (x-n* 


F** dt 

y-r, (1 -#) 11 *' 


d r + ’ dt _ i i 

dy “ [l-(2/+?) 2 F 2 [1 — (y — ?) 3 ] 1/a * 


which is equal to 0 if y = 0, positive if y> 0 , negative if 
y< 0 , the value of the integral is a minimum when y — 0, 
that is, when the interval of integration is symmetric about 
the origin, and increases steadily as the interval is displaced 
toward either side.) As 8 < 1, 0 < t < 1 in the integral 
from 1 —6 to 1, 1—f*>l—0, and 
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So 


f 1 dt 

Jx-cf (l-^ 2 


< 


r 1 dt 

Ji-tf a -«) w 


= 2d 1 ' 2 . 


! P« (O') — Pn(a—d)| < 2 nL d 1 ' 2 . 


An equivalent statement is that if x x and x a are two numbers 
in (—1, 1), differing by not more than unity, 


!P»(* s )— 1 < 2 nL\x a — a r. 


Now suppose that L is the maximum of | P n (x) , for 
a<x<b, and let as* and x s be two numbers of this interval, dif¬ 
fering by not more than (b — a). Let y — (2x — a — b)/(b — a), 
the values of y corresponding to .r L and x a being y x and y ± , 
and let P n (x) — Qn (y) . Then L is the maximum of \Qn(y) • 
for — 1 < y < 1, y x and y a are two numbers of this interval, 
differing by not more than unity, and 

i P n (x 2 ) —P»(a?i) | == i Qn(y 2 ) — Qn (yi) i < 2 nL | y 2 —yi | 1/2 
— 2nL\2\x 2 — xi\/(b —«)] 1/2 . 

The conclusion is 

Corollary HI. If P n (x) is a polynomial of the nth degree, 
and L the maximum of | P n (sc) | for a < x < b, and if x x 
and Xt are two numbers in the interval, differing by not more 
than — a), then 

IP»0%) —Pnfcri) \<HnL\x*—x x p» 
where H = 2/[i(6—a)] 1/a . 

By the introduction of an intermediate value x 9 — % (xi -f sc*) , 
it may be seen that 

ip. w-p.(®oi < i r, 

without the restriction that \x s —sex \ <\(b — a). 

The way has now been prepared for a discussion of 
problems of approximation associated with the integral 

X eG») 1/0*) —Pn (a) | m dx, 
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where q(x) and f(x) are given functions in the interval 
(a, b), P n (x) is a polynomial of the nth degree, and m is 
a given exponent >0. 

When m — 2 , a theorem of existence and uniqueness may 
be stated as follows: 

Theorem VI. If q(x) is a summable function which is 
nowhere negative, and is different from zero over a point set 
of positive measure in the interval (a, b), if f (or) is a summable 
function such that ? (if) [f(oc)Y * s &lso summable from a fob. 
and if P n (x) is a polynomial of specified degree n, the integral 

X Q&) [f(x)—Pn(.x)]*da‘ 

has a minimum, which is attained for one. and just one de¬ 
termination of the coefficients in P n (pc). 

The proof is entirely analogous to that of Theorem III. 
The functions 

■w(x), xw(x), of w(x), •••, 


where w(x) — [e(a;)] 1/2 , are summable together with their 
squares and properly independent over the interval (a, b). 
If they are taken as the functions qifx) previously considered, 
the corresponding functions pk(x) have the form io(x) Yk(x), 
where Yk(x) is a polynomial of the fcth degree, and 


f q(x) Yj(x) Yjc(x) dx — 0 
$(x) [Ffc(a;)] 2 <;£C = 1. 


O' + *), 


The polynomials Yk(x) are called the Tchebychef polynomials 
corresponding to the characteristic fwnction q(x) in the inter¬ 
val (a, b). Any polynomial P n (x), of the wth degree, can be 
expressed as a linear combination of F 0 , Fi, • • •, Y n , and 
for the minimising of the integral in the statement of the 
theorem it is necessary and sufficient that the coefficient of 
Yk in this expression be 

f a Q&)f(fi) Yk(x) dx. 
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In the case that q is identically equal to 1 and the interval 
is that from —1 to +1, Tk(pc) is equal to (k -j- %) 112 Xic(x), 
where Xu is the Legendre polynomial of the fcth degree; and 
the approximating polynomial P n (x) then is the partial sum 
of the Legendre series for f{pc). 

For an arbitrary value of m "> 0 , theorems of existence 
and uniqueness can again be established in analogy with those 
of the trigonometric case. An approximating polynomial, 
reducing the integral to a minimum, exists if f(x) is bounded 
and measurable, and if q(x) is summable, nowhere negative, 
and positive over a point set of positive measure in (a, 6). 
The approximating polynomial is unique when m > 1, and, 
with suitably restricted hypotheses, when m = 1 also. The 
proofs will not be given here, the further discussion being 
put in such a form that a knowledge of them is not essential. 

Let fix') be a continuous function for a <. x <. b, and P n (x) 
an arbitrary polynomial of the nth degree. Let q(x) be 
summable over (a, 6), and nowhere negative in the interval, 
and let q (x) v > 0 for a 0 ^ x < jS 0 > where a a 0 < <b. 

(It will be noticed, as a departure from the conditions of 
the earlier convergence proofs, that the hypothesis q v7>0 
does not necessarily hold over the entire range of variation 
of x.) Let 

9* ~ fa e ^ 1/6*0 ~ AC*) \ m dx. 

Furthermore, let p n (x) be a polynomial of the nth degree, 
in general distinct from P n (x), 

\f(x)—p n (x)\ < e n 

for a < x ^ 6, and 

f(x)—p n (x) = r(cc) t P n (x) — p n (x) — 7 t n (x), 
so that • 

X 9 l r G 33 ) ~ **»(*) I™ dx = 9n- 

Suppose that fi n is the maximum of | n n (a:) | for « 0 x ftoi 
and that is a point in (a 0 , fl 0 ") at which this maximum is 
attained. By Corollary HI above, 
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1 7 t» (x )— TTjiOo)! ^ Hrtfin r — .r„ 1 ' 2 

as long as a? is in (a 0 , fi 0 ) and distant from ./•„ by not more than 
i(A—“o)j the factor H being equal to 2 /[^ (/? 0 —« 0 )] 1/2 . These 
conditions "will be satisfied for n ^ 1 throughout one or the 
other (or both) of the intervals (x 0 — d, x 0 ), (,r 0 , x a + d). if 

t _ 1 _ A «o 

4I? 2 w s — 32 m 3 • 

Over such an interval, then, since \x — x 0 1 1 '- < 1/(2 Rn). 

Wn (x) — 7t n (cco) I < ~ t*»> I Tr “ (f)! = Y /*« • 

If % < if*nj 

I r(se) — 7 T n (x )| > 

over the same range; and, as the length of the interval is 
l/(45*»*), 

_ > V (PA™ 

9n ^ 4i? a w s \ 4 j ’ 

< 4(4jS r */v) 1/m (n a ff n ) 1,m = Bo («W /m > 
with = 4(4J9 ra /w) 1/TO . Whether «» < £/*« or not. 

Pn ^ 4 £n., 

]/(ar) — B n (aj) | = | r(x) — n n (x) | < B 0 (n*g n ) llm + 5 e n 
throughout (« 0 > A>)- 

Let Yn be the greatest lower bound of g n , when q(x),f(x). 
and m. are given. As an immediate consequence of the 
definition, 

Yn Q&) \f(x) — p n (x)\ m dx <, Ie%, I = f a q(x)dx. 

If polynomials P n (sc) are chosen for an infinite succession of 
values of n so that g n ^ Ay n , where A is independent of m, 
then g)l m <; (AI) lfm e n , and 

\f(x) — P n (x) | <1 Bn 2 lm e H , 
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i)8 

for «„ ^ x < fio, with a factor B which is likewise inde¬ 
pendent of n. 

The conditions to he imposed on fix), in order that it may 
be possible to make n? !m e n approach zero, are to he obtained 
from Theorems VI and YJH of Chapter I, and the Corollary 
of the latter. It will be sufficient to state the results form¬ 
ally for m > 2, though corresponding statements could readily 
be added for 0<m<C2: 

Theorem VTL If q(x) is summahle over (a, b), and has 
a positive lower bound for a 0 < x < fo, where a<L «« <./£<> 
^ by and if a polynomial B n ( x ) is chosen for each of an 
infinite set of values ofn so that g n ^ Ay H , where A is inde¬ 
pendent of n, these polynomials will converge uniformly toward 
fix) for ct 0 <, x <, JH 0) when m is held fast and n is allowed 
to become infinite, if m > 2 and fix) has a modulus of con¬ 
tinuity to(d) such that lim^o co(d)/d 2 l m = 0, or if m — 2 
and fix) has a continuous derivative. 

Under the conditions formulated, there will in fact be one 
and just one polynomial of each degree n for which g n — Yn, 
not merely for m — 2 , in accordance with Theorem VI, but 
also for an arbitrary m> 2 , under a theorem cited above 
without proof. Theorem VII naturally applies to the con¬ 
vergence of these approximating polynomials. More particular¬ 
ly still, for q{x) — 1 , a = — 1 , b — 1 , one may state 

Corollary I. If fix) has a continuous derivative for 
— 1 x ^ 1, its Legendre series converges uniformly toward 
fix) throughout the closed interval. 

Another part of the content of the theorem, aside from the 
question of uniformity of convergence, is expressed in 

Corollary H. If q (x) is continuous and nowhere negative 
for a x b, the hypotheses of Theorem VII remaining 
otherwise unchanged, the polynomials JP n ix ) will converge toward 
fix) at all points where q(x) 4 1 0. 

Additional information can be gained by further elaboration 
of the proof of the theorem. Let the previous notation be 
continued in force, with the understanding' now that poly¬ 
nomials JPn (a:) and Pn(x) are defined for all positive integral 
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values of n, and that these sequences of polynomials are 
kept unchanged throughout the course of the reasoning. Let 
« and A be two numbers such that a 0 <. a <C/3 </l 0 . Let 
«i, « a » ••• and Ai, As, ••• be two infinite successions of 
numbers such that 

®o ■'C < C * * * ® ^ <1! • * • A 2 A A) • 

Let be the maximum of |tt m (;k)| for « < r < A. and 
the maximum of the same function for a* 5 < or < A*; f*»o is 
the same as the previous /» n . It follows immediately from 
the definitions that 

^ /*«2 ^ = J 'n* 

Until the contrary case is explicitly mentioned, it will be 
assumed that Then, a fortiori, f*,#, and 

f&no < Bo gn) llm , by the earlier work. If < ^L^n, as will 
be assumed from now on, g)[ m does not exceed a constant 
multiple of e n , and 

' Mio < A 0 n 2 l m e n , 

where Ao is independent of n. 

Suppose it is known, for a specified value of ft, that 

fettle ~ Afc ri a £» , 

A* being independent of n. The exponent c will be denoted 
by <fk when there is occasion to emphasize its dependence 
on ft. By Corollary II of Bernstein’s theorem, 

! (a?) ( < hnf*nk 

for a*4-i^£C^Afc+i> the factor ft depending on a*, Aft, «*+i, and 
Ah-i, but not on anything else. Let x x be a point in (a*.fi, A&+i) 
such that | -rt n (pot) [ = /* n ,k-{-i, and let A = /*n,fc+-i/(2 ft w/*«*). 
Because of the fact that ^ g>nk, A 1/(2 hri), and it 

is certain that at least one of the intervals (*1 — d lt ajt), 
(xx, A) consists entirely of points belonging to Ah-i), 
as soon as n is sufficiently large. This condition being satisfied, 


7* 
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fin (re) — (re,; t*n. **-i < 1 n n G») I ^ /*«, *+i ? 


throughout the interval designated, and as it has been supposed 
that f» 

.. 1 ^ .* /^n,M-x\ w v t*Zk+i 

r(.r)—fi»(x), ^ 4 .0* ^ ®4i (— 4 —/ ~ 4^2W^-' 


where Bk+i is independent of n. Taken with the relations 
ffn^AIe%, A k n a e n , this means that 

J“ W( *+x < 

or, if 5fc + i(J. J.fcl) 1/<B1+1) is denoted by A*+1, 

/««,*+! ^ n^'^+D « B . 


Because of a qualification introduced at one stage in the 
work, the result has been obtained in the first instance only 
for values of n from a certain point on; but it can be extended 
to the finite number of values of n previously neglected, from 
n — 1 on, by • a suitable adjustment of the value of A*+i. 
It is recognized therefore that the successive numbers fink have 
•upper bounds involving a sequence of exponents o'*, beginning 
with <r 0 = 2/m, and so related that o*+i= (o)rf-l)/(m+l)» 
It may readily be verified by induction that 

„ _ 1 | 1 
k m m(m-f-l) fc 

If ij is any positive number, it will be possible to choose 
a value of k for which <%< (1/m) + tj . With such a value 
of k, 

J 'n<, t*hk^ A lt n (XIm)+1 > e n . 

All this, it may now be recalled, is on the assumption that 
«»< i v n . But in any case 

4 e n -\-A k r^ m ^s n . 

The last relation, by the definitions of v n and e n , implies 
that 
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| / (as) — P n {x) \ — \ r ix) — n n (») j <5 A* « (1 m)+Ti *» 
for « ^ a? ^ ft. More concisely, since 1 n a/m:)+7! for n 1. 

| f(x) — Pn (/) I ^ Cn alm) +9 , 

if C r = -4.* + 5, the factor C, like those which have preceded 
it in similar situations, being independent of n. The previous 
results with regard to convergence may be supplemented by 
stating- 

Theorem VIII. If q (x) and P n (as) satisfy the hypotheses 
of Theorem VII, and if a 0 < « < fi< fio > the polynomials P n (ac) 
will converge uniformly toward fix) for a <1 x fi, if m> 1 . 
and if a positive number q exists such that 

lim to (d)/«J Wm)+5 ? = 0 , 

<J=o 

where to (d) is the modulus of continuity of fix). 

Corollary. If q ix) is continuous and nowhere negative for 
a<Lx-<b, the hypotheses with regard to P n ix) and fix) 
remaining unchanged, the polynomials Pnix) will converge 
toward fix) at all points where q (x) 4 0 . 

For points in the interior of (a, b), this corollary super¬ 
sedes the second corollary of Theorem VII, being a direct 
generalization of it. 

There are analogous but less simple conclusions for 
0 <m 1 . 

Generalizations of Bernstein’s theorem (in appropriately 
modified form), leading to the extension of parts of the pre¬ 
ceding analysis to certain cases of development in terms of 
characteristic functions of differential systems, have been 
given by Miss Carlson (Transactions of the American Mathe¬ 
matical Society, vol.26 (1924), pp. 230-240, and vol. 28 (1926), 
pp. 435-447). 

5 . Polynomial approximation over an infinite 

interval 

Another extension of the scope of the method lies in its 
application to problems of polynomial approximation over an 
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infinite interval. To be specific, let fix) be a function defined 
and continuous for all real values of x, and not greater in 
absolute value than a constant multiple of a power of x, as 
x becomes infinite in either direction. Among all polynomials 
of specified degree n, let Pnix) be the one which minimizes 
the integral 

r e-* 2 [f(x)—Pn{x)Ydx. 

e/—oo 

The proof of the existence of the minimum and the uniqueness 
of the corresponding polynomial offers no new difficulties in 
comparison with cases already treated, and will be omitted. (It 
can be based either on the construction of a sequence of orthog¬ 
onal polynomials over the infinite interval with the weight- 
function e~^, essentially the polynomials of Tchebychef-Hermite, 
.or on the fundamental theorems of real analysis. From the 
latter point of view, the essential point in the passage to the 
infinite interval is merely that if any coefficient in P n (x) were 
large, the integral over a fimte interval would be large, and the 
integral over the infinite interval would be larger still, so that 
the coefficients to be considered in the search for a minimum 
belong to a bounded domain.) It is to be shown that under 
suitable restrictions on fix) the polynomial P n ix) will converge 
everywhere to f (x) as n becomes infinite, and will converge 
uniformly over any interval of finite extent. The aim will 
be to arrive at a result of this character in as straight¬ 
forward a manner as possible, without regard for the greatest 
attainable generality. 

Let Yn be the minimum value of the integral, and let a 
be an arbitrary preassigned positive number. Let pnix) be 
an arbitrary polynomial of the wth degree, let 

r»(a>) = fix)—p n ix), i T n ix) — Pnix) — Pnix), 

so that 

fix)—Pnix) — rnix) — TCnix), 
and let e n be the maximum of j r n (ce) | for —(«+l) «+1. 

Lot Pn be the maximum of | n n ix) | in the same interval, 
attained for x = xq. 
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By the third corollary attached to Bernstein's theorem on 
the derivative of a polynomial, there is a positive number H. 
independent of n, such that 

1 7i n (pc) — Tt n (x 0 ) Hn (t n \x — .r 0 12 


as long as x is in that one of the intervals (— a — 

(0, a+1), to which x 0 belongs. This condition will be 
satisfied, on one side of x 0 at least, for all values of n from 
a certain point on, if |a-— x 0 | <; l/(4JS‘ 8 « a ). So there will 
be an interval of length 1/(41? 2 »*) throughout which 

1 1 

I Kn(x) 7ln(x 0 )\ < Y^ 11 ’ ! 71 n O 35 ) I ^ 

If it is supposed that e n ^ \p n , then 

\f(x) — Pn (x)\ — | r„ (a) — 7T n (a*) j -j- fh, 


throughout the same interval, whence it follows that 


Yn e -(«+W J . 


aL 1 

16 ’ 4 ' 




where ci is independent of n. The alternative possibility 
that « n >i (*n may be left out of account until a later stage. 

Now let fi'n be the maximum of | n n (x) | for —a < x < «. 
It is certain that Let xi be a point in (—«. «) 

such that 1 7t n (xi) | = fi' n . By the second corollary of Bern¬ 
stein’s theorem for polynomials, there is an h independent 
of n such that 


I Ti' n (x) i < hnpn* i Tt n (x) — Ttnixi) i < hnfi n | a *—xi |, 

for — a<x<a. (It is clear that Ji must be positive.) The 
quantity 3 n = jt*»/(2Aw.|tt> n ) is less than a for values of n from 
a certain point on, since /*»</*«. One or the other of 
the intervals ( xi — 3 n , £Ci), (a?i, is then contained in 

(— a, a), and there is an interval of length at least through¬ 
out which 

t Ti n (x) — 7t n (xi) | fg Y^’ 1 ’ I n *( x ) 1 ^ 2 fi ' n ' 
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If £ n < Iv'ii' 

I fix) — JP/i (x) : — 1 r n (a*) — (sc) | > \ /«» 

in this interval, and 

with oa independent of n . The assumption that e n carries 

with it the condition s n i(* n , under which p n <,c x ny\ f 2 ? 
so that 

n i* n y„ < c t n 2 yf 1 , f n < c A y% 2 . 

If the hypothesis that £ n <£ \ is not fulfilled, /4<4«« 
immediately. In either case 

K ^ c s n 2 ' 8 y 1 r l 2 + 4e n> 

with the reservation merely that a finite number of values 
of n may have had to be ruled out in the course of the proof. 

As Pn and e n are upper bounds for | n n (x) j and | r n (sc) | 
respectively, 

|/(sr)~P n (.r) = j r n (sc) — n n (x) ] <£ c a n 2/s yV 2 +-5e» 

for —a <! # <; a. For fixed a, the polynomials p n {x) can 
surely be determined so that limn=.oo = 0 , since /(a?) is 
continuous. In order that P n {x) shall converge toward fix), 
uniformly throughout the interval (—«, a), whatever value 
may he assigned to a, it is sufficient that lim»=oo» 4 / 8 r»» = 0. 
It remains to discuss the order of magnitude of y n , under 
appropriate hypotheses on /(sc). 

There will be occasion to use a lemma regarding the 
behavior of a polynomial as its argument becomes infinite. 
Let p»(sc) be an arbitrary polynomial of the wth degree, 
»1, and let M be an upper bound for its absolute value 
in the interval — 1 <] sc < 1 . Let Xk(x), as in Chapter I, 
be the Legendre polynomial of degree k. Then pn(x) may 
be written identically in the form 

P* (sc) = Ao Xo(sc)■+ AxXi(tx>) -f- ••• -j- A n X n (sc ), 
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where 


-J* — — X k (x)dx. 


From the last equation, together with the definition of M and 
the fact that | X k (sc) | 1 in (—1, 1), it follows that 

'M ^(2*+l)3/. 

For | x | 1, on the other hand, the identity 


gives 


X k (x) = — | [£C-h (ic 2 —1) 1/2 cos <p] k dcp 

71 0 

|X ft (x)| ^ |2ar[ fe , 


since (x 2 — l) v -1 cos y| ^ |x|. Consequently, under the assump¬ 
tion still that \x\ > 1, 

\pn(x)\ ^ M\l-\-3\Zx\-\-5\2x\ i + (2«-4-l)!2x|*] 

Jh£\_ 1 —3 —5 —f— (2-w 1 )312 jtI” 

— M\2x\ n (w + n) s Jf|2xi“ = 4i/'n 2 |2a , j“. 


If p n (x) is a polynomial of the nth. degree, as before, and 
if M is an upper hound for \p n (x)\ in an interval — fi^x 
^ fi, where fi is an arbitrary positive number, then p n (x) 
is at the same time a polynomial of the wth degree in xffi. 
and for |x| fi, \x/fi\ ^ 1, 

jp»(x)| <j 4Hfw 2 |2£r/^|*. 

This relation, under the hypotheses stated, constitutes the 
lemma in question. 

Let it be assumed now that fix) has everywhere a first 
derivative satisfying the condition 

I/'(a*)— f(pi)\ ^ h\xa — Xi\. 

Let fi be an arbitrary positive number, and let Theorem VII 
of Chapter I be applied to the approximate representation 
of f(x) in the interval (— fi, fi)- The conclusion is that there 
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exists for each m >1 a polynomial p n (x), of the nth degree, 
such that 

ljf/ . , v . 2K % ft*l 

\f(x)—pn(x)\ < 


n- 


f or — ft x ft, where K is the absolute constant that 
figures in Theorem I of the first chapter. 

As an immediate consequence, 

Iifi) = < iK n t ^ §_ f e ~*' iX 

— nr J-oo 

or, as the value of the last integral is n 1 ^ < 2 , 

■ -• ' 

Furthermore, |p*(a;)| M x + 2 K 2 A 2 X/n* for — ft ^ x ^ ft, 
if M x is the maximum of \f(x) | in the interval. Let |/(0)| 
= «o, |/'( 0 )| = eh\ then \f'{x)\ <£ (h + X\x\, |/(aO| 

^ «o+ Ui |a;| + x s , and 

Mi ^ ao+axft + ~Xft\ 

(Incidentally it is seen that the present hypothesis carries 
with it the fulfillment of the earlier requirement that f(x) 
shall not become infinite faster than a power of as.) If ft ^ 1 , 
as will be assumed henceforth, M x (oo + ai + i^)/* 8 , and 


\Pn(x) | ^ Cift* 


for — ft ^ x ^ ft, where c 4 is independent of ft and of n, 
though it is different for different functions f(x). 

For \x\ ft, by the reasoning of an earlier paragraph, 

Lp»(aO| ^ 4c 4 y9 8 w s |2aj/^| w . 

At the same time (that is, for \x\ ^ ft 1 ) 

l/(®)l ^ ^ (ao + ai + Y^)^ 
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and by further enlargement of the right-hand member, with 
«s = «o+ «i+ i A for abbreviation, 

!/(*) I ^ «2 a; 3 = ± a 2 /3 s (2 x//3) s < ~ it s / 8 s n s (2 x/fi) s 

= £ 2 « 2 ; - #/j0 I"; 

it has already been assumed that n ^ 2. So, with a new 
multiplier c 6 , independent of /S and of n, 

1/(33) —Pn (a) I ^ (4 C 4 + ~ Oaj/S 8 % 2 I 2 W/SI", 

[/(a) — Pn (a )] 2 ^ c 5 w 4 1 2 a//S | 2w 

for | x | fi. 

This means that 

r / 9 \ 2 « 

e~ a? [fix) — p n (a)] 2 dx c 5 /S 4 n 4 ’a 2 ” e-^ d.v, 

the quantity on the right is an upper bound also for the 
integral from —00 to — fi. Since /3> 1, 

ccP^e"* 3 dx<C a? n x dx =^ y" 1 e~ y dy 

<-|-X y ne ~ yd y = Y r («+i) = Y nl ~ \ n ^ 1 ~ 1,! 

Without reference to more elaborate approximations to the 
value of the factorial, it is seen at once that 

J Vc+l /»fc+l 

k lo gkdt<J k lo gtdt, Jc = 1.2,---, 

log [(n — 1)!] = logl-{-log2-|-*—(-log(n—1) 

<J^Io gtdt — nlogw — n+ 1. 

in — 1)! < e ini e) n . 

Consequently 

-=T****VF7) ’ 
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and 



e~ ,r * [/(Jr) — Pn (a?)] 2 dx 




8 

w 4 


+ r 6 e/S 4 n 6 



The number ft has been arbitrary hitherto, except for the 
requirement that it be positive and not less than 1 ; the last 
relation is valid with the insertion of any such value of ft. 
you* let ft — 2 « 1/s . This makes 

tin <;-- h 16 (‘ 6 ew 7 

As n 9 e~ n approaches zero when n becomes infinite, n 7 e — " does 
not exceed a constant multiple of l/n 2 for w 2 £ 2 , while, on 
the other hand, g n is not less than the minimum value rn- 
Therefore 

v Cg 

Yn^ , 

with c 6 independent of n . 

The condition limn=oo n i!i y n = 0 being satisfied, P n ix) con¬ 
verges uniformly toward f(x) throughout any finite interval, 
under the hypotheses stated. 

As in the case of the Fourier and Legendre series, the method 
under discussion, when applied merely to the classical problem, 
yields little if anything that is new, and misses much that is 
well known. Clearly, however, the treatment admits a variety 
of generalizations, which remain open for further investigation. 
To mention just one, which calls for no additional labor, the 
reasoning applies without material change if the weight func¬ 
tion e~ xi is replaced by any positive continuous function which 
is never greater than a constant multiple of e — •**. The theory 
thus suggested has been developed at some length in a thesis, 
as yet unpublished, by J. M. Earl. Theorems on degree of 
approximation over an infinite interval, without reference to 
the particular method of this chapter, have been published 
by W. E. Milne in vol. 31 of the Transactions of the American 
Mathematical Society. 



CHAPTER IV 

TRIGONOMETRIC INTERPOLATION 

I. Fundamental formulas of trigonometric 
interpolation 

This chapter is concerned with certain striking analogies, 
both formal and analytical, between the theory of interpolation 
by means of trigonometric sums and that of Fourier series. 
The case of polynomial interpolation will be left out of con¬ 
sideration for the most part, since the analogies there, when 
the points used for interpolating are equally spaced, are 
rather with Taylor’s series than with those of Fourier and 
Legendre. Certain extensions to the ease of interpolation 
by means of Sturm-Liouville sums have been carried through 
by C. M. Jensen (Transactions of the American Mathematical 
Society, vol. 29 (1927), pp. 54-79). 

Let t 0 , ti, - • •, tan be 2n + 1 distinct numbers contained in 
an interval of length 2n, for definiteness that from 0 to 2/r, 
inclusive of the left-hand and exclusive of the right-hand end 
point. Let y 0 , yi, - • *, y*n be any 2n + l real numbers, 
distinct or not. Let the problem be proposed of determining 
a trigonometric sum T n (cc), of the roth order, to satisfy the 
conditions 

Tn (tr) — yr, r — O, 1, * * •, 2m. 

The sum T n (pc) has 2n-\~l coefficients, on which 2 n +1 
conditions are imposed. With the notation 

T n (pc) = Oo -+■ COS 35+ • * - + OLn COS HOC 

4- b t sin x + • - • -M>» sin nx, 

the conditions to be satisfied are given explicitly by the 
2w + l equations 


10f> 



110 


THE T HE ORY OF APPBOXIMATION 


Oo + fficostfo 4* 4 sin 4 4-j-ctn cos+&ftSin«io —3/o ? 

rto —}- cos fi -I - J>i sill ti -(-•••-j- cosm^i 4~ bn, sin n t± y± . 

^o4“®i cos4« -f"&i sin4» 4“ *4“ fl»cos nt$n 4" sinwfe« = y%n . 

which are linear in the a’s and b’s. One is confronted by 
the problem of showing, directly or indirectly, that the deter¬ 
minant of these equations, the determinant 


1 

COS to 

sin to 

• • • COS W to 

sin nt 0 

1 

cos 4 

sin 4 

• • • COS 71 t) l 

sin ntx 

1 

• ♦ * 

COS t%n 

sin hn 

• • * COS 71 fen 

sin nUn 


is different from zero. 

If a and /S are any two numbers, the product 
sin — (x — a) sin -^-(x — /S) 

is a trigonometric sum of the first order in x, being identically 
equal to 

T [ cos (4 “ — J fi ) ~ cos (* “ T “ ~T fi ) ] 

= Ao+At cos x4 "-Bx sin x, 

with Aq — % cos (i « — 4 fi) , A x — — £ cos (£ « 4- 4 fi), 

B t = — $ sin (i a 4- £ /$). In the expression 

9k (x) = 

sin $ (pc —<o)...sin^(a;—4-i) sinfr (pc —<fc+-i)---sinj(a; — fe n ) 
sin$(4—4)*--sin4(4—4-i)sin£( 4 —4+i)---sin£(4—4»)' 

the 2n factors of the numerator can be combined in pairs 
to give n expressions, each of which is a trigonometric sum 
of the first order, and the product of these is a trigonometric 
sum of the wth order; the denominator is a constant which 
is not zero, since the difference of any two of the numbers 4 
is by hypothesis different from zero and numerically less than 
2 ti . It is apparent on inspection that y* (4) = 0 for r 4= k, 
while g>k (4) = 1. Consequently the function T n (x) defined by 
the formula 
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2 n 

T n C®> = 2 2/a <Pl <*) 

A *=0 

/s a trigonometric mm of the nth order such that T n (f r ) — y,. 
r = 0 , l,--, 2 w. The linear equations above accordingly 
have at least one solution, for any assigned values of the y's. 
But if the determinant of the equations were zero, there would 
be values of the right-hand members for which there would 
be no solution. It is certain therefore that the determinant 
is different from zero. Whatever values are given to the ys, 
the proposed problem has one and just one solution. In par¬ 
ticular, if all the y’s are zero, the obvious solution in which 
all the a’s and Vs are zero is the only one. A, trigonometric 
sum of the nth order which vanishes at 2 n + 1 distinct points 
in a period is identically zero ; fuo trigonometric sums of the 
nth order which coincide in value at 2 n +1 distinct points of 
a period are identically equal. 

It is easy now to supply a proof of a fact which was pre¬ 
viously assumed as known, in connection with the proof of 
Bernstein’s theorem, namely that a trigonometric sum of the 
wth order vanishes identically if it has 2 n distinct roots in 
a period, one of which is a double root. Let T n if) be a sum 
of the Tith order which takes on the value 0 for 2 n distinct 
points of a period, say for x ~ 4 , 4, • • h n , but which does 
not vanish identically. Let to he a point of the same period 
interval, distinct from 4 , • • ■, and let y 0 = Tn (to) ; it is 
certain that y 0 + 0, since T n (x) can not have 2 n-\- 1 distinct 
roots in the interval. The expression 

sin £ (a: — 4 ) sin £ (a; — 4 )«♦ • sin £ (a; — tin} 

2,0 sin 4 (.to — 4) sin \ (4— 4) • • • sin (4— 4») 

s a trigonometric sum of the wth order which takes on the 
value y 0 for x — to, and vanishes for x = 4 , * • -» 4», and so 
must he identical with T n (x). The derivative of th i s expression 
is easily calculated explicitly; it is 

.__ 1 _ y sin $(x — 4 )—cos^Ca;—fe)—sinH^~4n) 

n 2 sin 1% (4— 4 )--sin^( 4 —4)• ■ • sin^(4 fe») 
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For sc = t r , if r has any of the values 1, • •*, 2 n, all the 
terms of the summation vanish; except the one for which 
k = r, and that becomes 

j __ sin—fi) -• -_1_ ••• sin^(4—fen) 

*" sin ■J (to fij • • • sin ■J (£o t ,) * * * sin ■J (to tin) 

which is certainly different from zero, so that Th (£»•)—i yo D r =£0, 
the sum T n (x), assumed not to vanish identically, can not 
have a double root at any of the points ti, ♦ •*, few 

To return to the problem of interpolation, let the points t, 
from nmv on be supposed equally spaced over a period, As 
a matter of general notation, if m is any positive integer 
and r any real integer, positive, negative, or zero, let 

U = 2rn\m. 

The discussion hitherto (apart from the digression of the last 
paragraph) is applicable on the assumption that m is odd, 
m = 271+1; there will be occasion subsequently to consider 
even values of m as well. Throughout the remainder of tins 
chapter, in the absence of express indication to the contrary, 
the sign £ unll be understood to refer to summation with 
respect to the index r, from » = 0 to t — m —1, or what 
comes to the same thing, as all the functions considered will 
be of period 2n, over any m successive values of r. 

It is a fundamental fact, whether m is odd or even, that 

S sin kt r = 0 
if it is any integer, and that 

£ cos k tr = 0 

if % is an integer not divisible by m, while if Tc is a multiple 
of m (and in particular if k — 0) it is evident that 

y cos ktr— m, 

since in this case each term of the summation is equal to 1. 
The truth of the statement with regard to sin ktr is 



IV. TRIGONOMETRIC INTERPOLATION 


113 


apparent from considerations of symmetry. The term sin ktois 
equal to 0. Since 

Jcttn—r == 2 k(m — r)n/m — 2k n — kt r . 


each term corresponding to an index r between 0 and \vi 
is paired with a numerically equal term of opposite sign. 
If m is odd, all the terms are thus accounted for; if m is even, 
the term which remains by itself is sin kt m /2 — sin kn = 0. 
In connection with the cosine sum it is to be recalled that 


1 

2 


4- COSM-f cos 2m + 


+ cosntt — 


sin (n 4~ ti 
2 sin^w 


(The right-hand member is understood to be defined by con¬ 
tinuity at points where the denominator vanishes.) Tiu — kU, 
then ru — kt r , and 


-f-COSfcfr-f-COS&lfeH- ••• -(-C0S&#n 


sin 

2 sin ti 


But by the symmetry pointed out in the preceding paragraph, 
the terms on the left are respectively the same as those of 
the sum 

~ 2 ~ "1“ C0sA£ m _i4" COSkim— 2 4" * * * 4" COSktm—n~ 


If m — 2n-j-l, and if k is not divisible by m, addition of 


the two sums gives 

COS ktr 


sin(n + ^)fc^ 
sin ^ kt L 


and in the last expression the numerator is zero, since 
ti = 2n/m and n + 4 — \yn>, so that (n 4- i)ki t = kn, while 
the denominator is different from zero. If m — 2w, the tern 
cos kin = coswfe^i occurs in each sum; combination of the 
two, with subtraction of the redundant term, gives 


^cos&4 = 


^i±mk _ C0SBtt , 

sm f kti 

sinwfci 1 cosiJfc#i4'COSwft<isinilc< l _ 7>J 

— -:—- - -COS fin ti 

sin£fcii 

sinw&tfi cot—-kti == sinfewcot(Arar/»n) = 0, 


the assumption being still that k is not divisible by «*. 


s 
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The proposition of which the preceding lines give an 
analytical proof is almost evident geometrically, though 
a precise formulation of the geometrical argument requires 
a little attention to detail. The pairs of numbers (cos 4 , sin4), 
r = 0 , 1 , • • -, m — 1 , are the coordinates of the vertices of 
a regular polygon of m sides having its center at the origin, 
and the quantities ( 1 /m) 2 cosir, ( 1 / m) £ sink, being the 
coordinates of the center of gravity of these points, must 
evidently be zero. The more general fact of Hie vanishing 
of 2 cos *4 and sin ft 4 is then obtained by consideration 
of the various possibilities as to the existence of common 
factors of Jc and m. 

In the following statements, let p and q be integers subject 
to the restrictions 0 < p < £m, 0 g \m. Then, in the 
identities 

cospa: cos qoc = -^-[cos(p— 2 )* + cos(p + g)#], 
sinpa: singa: = ~-[cos(p— q)x —cos(p + g)a?], 
cospa; sin qx = i-[sin (p + 2 )*— sin (p— q )»], 


p — g and p-j -2 are integers numerically less than m, except 
that p-f 2 = m if m is even and p = q — so that 


J£Jcosp4 cos 24 = 0 
sinp4 sin 24 = 0 
2/C0sp4 sin 24 — 0 
^cos a p4 = ^sin*p4 = -|-m 

S cos 8 pt r = m, y j sin 8 p4 = 0 


if P # q, 
if P + 2, 

for all p and q, 
if 0 <C p < m, 

if p = 0 or - 5 -m. 


Until the contrary is stated, let it be supposed now that 
m is odd, m — 2n+l. The equations for the interpolating 
coefficients given jn the second paragraph of the chapter 
can be solved explicitly, under the present hypothesis of 
equal spacing of the points 4* It is known in advance that 
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they have one and just one solution. If the 2 m+1 equations 
are added as they stand, the left-hand member in the result 
reduces to (2»+l)ff 0 . so that 

(As to the convention with regard to the meaning of the 
sign 2, it is to be understood here that the summation is 
performed specifically over the range r — 0, 1, •• *. 2n, or 
else that y r is defined outside this range as a periodic 
function of the index by the prescription that y r + 2 »+i = yr-) 
To determine a*, 0, let the (r+l)th equation be mul¬ 

tiplied by cos Jct r , r = 0 , 1 , • • •, 2n. On addition of the 
equations thus obtained, the left-hand member in the sum 
becomes cos a /c# r = + whence 

«* = 2^TT^ rC0S& * r - 

Similarly, 

bk= 2^fr2 yrSinktr ' 

If the notation is modified to the extent of representing the 
constant term in the interpolating sum by Oo/2, instead of cr 0 , 
the general formula for au gives the correct value of ao also. 
It will be understood henceforth that the notation is adjusted 
in this way, and the interpolating sum for equally spaced 
points t r will be denoted by S n (x), so that 

*S n(x) — -|-ao + ai cosjc-j- ••• -4-a»coswa; 

+ 5i sin as+ • * • -f &» sin nr, 

the a’s and Vs being given by the two preceding equations 
for k = 0,1, • • n. The resemblance to the Fourier coeffici¬ 
ents is apparent. 

Even if it had not been known a priori that the equations 
have a unique solution, that fact would be an immediate 
consequence of the work of the last paragraph. For the 

8 * 
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work shows that if the equations have a solution, it must he 
the one indicated, that is, they can not have more than one 
solution for any given set of y’s. But if the determinant 
of the equations were zero, there would he values of the y* s 
for which there would be infinitely many solutions. This new 
proof of the non-vanishing of the determinant, however, 
depends essentially on the assumption that the fs are equi¬ 
distant, and so is less general than the one previously given. 

The formal resemblance to the case of Fourier series is 
further borne out by substitution of the values of the a’s 
and Vs in the expression defining S n (a?), and rearrangement 
of the result by means of the identity for a sum of cosines: 

JS n (x) = y ^ (coskUcosJcx+ sinfe^-sinfejc)J 

“ 2 ^Fx • 2>[t+ 

= 1 v.,. sin (n + i)(k— x) 

2n-\-\^ Jr sin 4 (tr— x) 

Incidentally, the correctness of the last expression as a solution 
of the problem of interpolation can be verified directly, by 
substituting x — t g , for any particular value of q. For 
U-t q =2(r -q)n/(2n+l), sin (n+i)(tr—tq) = sin (r—g) rc = 0 , 
and all the terms of the summation vanish except the single 
one with a vanishing denominator, while the limiting value 
of the quotient of sines in that term is 2 w+l, so that 
S n (,t<[) ~ y q . 

Now let the problem be changed by supposing that m is 
even, m = 2n. Corresponding to the 2 n abscissas 4,---, hn-i, 
let 2 n numbers yo, • • •, yzn-i be given, and let y r be defined 
for other values of r so that y r + 2 n = y r - Consider the 
question of the existence of a trigonometric sum T n (x), of 
the nth order, represented for the moment by the notation 
of the second paragraph of the chapter, to satisfy the con¬ 
ditions T n (tr) = y r , r = 0, 1 , •••, 2 n — 1 . It is not to be 
expected that the problem will have a definite solution, since 
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only 2 n conditions are imposed on the 2n +1 coefficients. 
The precise nature of the indeterminacy is brought out by 
going through the formal manipulation of the equations. 
Written out at length, these are as follows: 

«o + «i cos t 0 + ft, sin fe 4-f* (in cos n 4- b n sin n t„ 

a 0 + «i cos fe + &! sin fe 4 - h «» cos n fe 4 - K sin n fe 

= .Vi- 


«0 4- cos fen— 14~ sinfe„_i4— 4" tin cosMfen—14~ &«sin n fe»—i 


Direct addition of them gives 

2 na 0 — 2i/r, a 0 = C£y>)/2«. 


— y^n—l • 


Multiplied respectively by cos&fe, cos&fe, •• >, cosfrfe«-i. for 
any value of Jc from 1 to n — 1 inclusive, and then added, 
they yield 

ah = (l/ri)]£l/r cosktr. 

and similarly 

h — (lln)^y r sinA'fe. 

So far there is no ambiguity, and no formal difference from 
the case previously treated, except the replacement of 2 n 4-1 
by 2w. When the (r4~l)th equation is multiplied by cos nt,. 
there is a difference because in the present case n — 
and 2cos s «fe is equal to m — 2n, instead of Zm. The 
determination is still perfectly definite, however: 

On = C? cosntr)/(2n). 


But as tr — rn/n, the numbers sin n t r are all zero, and the 
use of these quantities as multipliers does not lead to any 
determination of 6» at all. 

It is now apparent how the problem of interpolation with 
an even number of equidistant abscissas can be formulated 
so as to have a determinate solution. If S n (x) denotes an 
expression of the form 
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1- «o + <h cosaH-f ctn-i cos (n — 1) x-\- y a n cos nx 

+ & sin x 4-1- h,i-i sin (n —1) x, 

the conditions /S»(4) — y r are expressed by 2n linear equations 
for the 2« coefficients. The solution of these equations, if 
they have a solution, is given uniquely by the formulas 

a,k = (l/ri)]>JyrC0s7ctr, 6 fc = (1/n) £ y r siD.Jct r , 

for k — 0, 1, • • •, n. But if the determinant of the system 
were zero, there would be values of the y’s for which more 
than one solution would exist. So the value of the determinant 
is certainly different from zero, and the problem has the 
unique solution indicated. As sum 4- = 0 for all values 
of r, the expression 

S n (x) 6» sin was, 

with any value of 6», likewise takes on the value y r when 
x — U m , the indeterminacy of the problem as first proposed 
consists merely in the complete indeterminacy of the coef¬ 
ficient 6». The subsequent discussion will relate to the sum 
S n (x) as defined at the beginning of this paragraph, with¬ 
out the term in sin nx. 

For deriving a concise expression for S n (x), to be sure, 
it is convenient as a matter of form to include a term 
^ ft,* sin nx, the coefficient 6» being defined by the general 
formula for 6», with k — n; as each term of the summation 
vanishes, this gives b n — 0 automatically. Then it appears that 

^ r ^ n~~ i j -i 

&»(*)=— [y + cos Jci.tr — x) 4- y cos n(tr — x) I, 

or, as 

y + COSM-j- + COS (n —l)M+yCOSMM 

1 . . 1 
— y sm nu cot y u, 

by subtraction of £ cos nu from the identity previously used, 
S n (aj) = ^y r sin n(U — x) cot y (4-— x). 
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As in the case of an odd number of points, substitution of 
x ~ t q gives a direct verification of the validity of the formula 
for purposes of interpolation. The analogy with the partial 
sum of a Fourier series, however, is not superficially so much 
in evidence as before. 


2. Convergence and degree of convergence 
under hypotheses of continuity over entire period 

For the discussion of the analytical properties of the 
interpolating sums S n {x), particularly questions of convergence 
as n becomes infinite, let the given numbers y r be values 
of a function having specified properties: let fix) be a given 
function of period 2 n, and let y r — fit,) . Then S n (x) 
coincides in value with fix) at the points t r , and the question 
is to what extent S n (x) furnishes an approximation to fix) 
at intermediate points. 

In the first place, let fix) be a bounded function having M 
as an upper bound for its absolute value. Then a corre¬ 
sponding upper bound can be assigned for l^nCar) j. Let 
m — 2n-\- 1. If x has one of the values U, 1 S n (x) | = \f(U )' 
< M. If a? is not one of the numbers U, let tn be that one 
of these numbers which is nearest to sc, or one of the two 
nearest^ if two are equally near. Then 

tB ~ 2 ^ft £ ^ **+ jfpr- 


Because of the periodicity of the functions concerned, as has 
been noted already, the summation with regard to r in the 
formula representing 8 n (x) can be extended over any 2n +1 
successive values of r, and in particular from JR—n to U + n 
inclusive. Accordingly 


8 n (x) 


2w+l 

M 


R+n 




sin (w-j- j) (t r — x ) 
sin (tr — x) 


— 2n+l r^R—n 


sin jn + £) (tr—x) 

sini(tr— x) 
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The absolute value of the last expression in bars never 
exceeds 2n + 1 , as may be seen from its interpretation as a sum 
of cosines. This observation will be sufficient as far as the 
three terms corresponding to the indices r = R —1, R, iB + 1 
are concerned. The numbers fa+- 2 — x, £r +3 — x, • ••, tn+n—or 
are respectively greater than 2 tt/( 2 w+ 1 ), 4tt/(2m + 1)> • •* 
. 2 (n — l)nl(2n-\-l), the same thing may be paid of the 
sequence of numbers j fa- 2 — x\, ! fa-& — x\, • ••, ] fa- n —x |; 
and for all the values of r in question, | U — x\ <C it, so that 
| sin -J (fr —a?) | 2? (1 /tt) [ t r —.r ]. Hence 


sin (n + (t r — x) 
Sin 4 (4— X) 


3 (2 n -f-1) -j- 
< 3(2w + l) + 


/ V . yj l \| sin(tt + £)(k- x) 
l =R-n . ^R +2 !\ sin I ( u — x) 

. JR—2 x ^ 

=5f 3 ) I sin £ (tr — x) | 


( i2—2 jB-j-jj , - 

2 + 2 )ti -—r 

r=JR—n r-=JS-j~2/ | tr 

< 3{2w+l)+(2w + l)2;i = 4(2« + l)-f(2w+l)S — 

j=i 3 j =2 a 

< 4(2»+l)+(2w+l) 4 (2n-\-l) J r(2n-[- 1)log (n —1) 

wi 


< (2 n-\- l)(4 + log n), 
and 

I S n (x) | ^ M (4 + log n). 


The last parenthesis does not exceed a constant multiple of 
log n, for n^>2, and | S n (x) [ does not exceed a constant 
multiple of Mlogn. 

Similar reasoning, with minor differences of detail, applies 
when m — 2n, the sum S n (x) then being given by the formula 
of the third paragraph preceding. The conclusion may be 
stated comprehensively for both cases as 
Lemma X. If f(oa) is a function of period 2 n satisfying the 
condition that 

1 /( 3 ) j <; m 
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for all values of x, and if S n (r) is the interpolating sum of 
the nth order for fix) an responding to the subdii isiou of 
a, period either into 2»+l or into 2 n equal parts, then 

i S n U-) | < CM log n 

for n >1, where C is an absolute constant. 

The representation of the constant by the notation used 
in the lemma preceding Theorem IX of Chapter I does not 
imply that the constants are the same, though of course both 
lemmas could be stated together, with a single symbol to 
represent the larger of the two constants. 

For the application of the lemma, it is to be noted that 
i f{x) — S n {x) | M-\- CM logn, which is likewise not greater 

than a constant multiple of Mlogn, say BM log n, that the 
interpolating expression corresponding to the sum of two given 
functions is the sum of the interpolating expressions con¬ 
structed for the two functions separately, and the error of 
the sum is the sum of the errors; and that if m = 2 n -f 1 
the interpolating sum S n (x) formed for a function fix) which 
is itself a trigonometric sum T n (sc) of the nth order is iden¬ 
tical with T n (x), since by reason of the interpolating property 
S n (x) and T n (x) are trigonometric sums of the nth order 
coinciding in value at 2m + l distinct points of a period. 
Taken together with the lemma, these observations yield 
Theorem I. If f{x) is a function of period 2 rt, if S n (x) 
is the interpolating sum of the nth order for fix) corresponding 
to the subdivision of a period into 2?i + l equal parts, n> 1, 
and if there exists a trigonometric sum T n ix), of the nth order, 
such that 

\ fix)—Tnix) \ ^ 

for all values of x, then, for all values of x. 

I/Os) — Sn ix) I ^ Be„ log n, 

where B is an absolute constant. 

When m = 2n, the interpolating sum of the rath order, 
as defined above, for a trigonometric sum Tnix) of the nth order 
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is not in general the same as T n {x), since the interpolating 
sum does not contain any term in sin nx. But the inter¬ 
polating sum S n +-i(x) of order n + 1, obtained by taking 
m = 2W + 2, is identical with T n (x). For the coefficient 
Un-Pi in S }l +i (x) , given by the expression 

[l/(n + 1J] 2T n (t>) cos (» +1) tr, 

reduces to zero, since each of the sines and cosines in T n (as), 
through sin nx and cos nx, is orthogonal to cos(w+l)aj for 
summation over the finite range in question; and it follows 
that Sn+i (. x ) and T n (x) are trigonometric sums of the nth order 
agreeing in value at 2» + 2 distinct points of a period. The 
above statement may therefore be supplemented as follows 

Theorem I {continued). If 8 n +i (x) is the interpolating sum 
of order n +1 corresponding to the subdivision of a period 
into 2n + 2 equal parts, the hypotheses remaining otherwise 
unchanged, then 

\f{x) — S n +i (x) ; ^ B e n log (w +1), 

where B has the same value as lefore. 

As log (« +1) < 2 log n for n > 1, the right-hand member 
may be replaced by 2 B e n log n . As an alternative, n may 
be replaced by n — 1, to give 

| f{x) — S n (x) j < B e n —i log n 

for m = 2 n. 

Like Theorem IX of Chapter I, the present Theorem I can 
be combined with Theorems I-IV of Chapter I to give a suc¬ 
cession of more specific results. The cases of an odd number 
and of an even number of subdivisions can be covered by 
a single formulation each time, by virtue of the observation that 
l/(n —1) 2 Jn and oo[27t/{n —1)] ^ 2oo{2n/n), 

for 1. The symbol A is merely a notation for the largest 
of what would be obtained in the first instance as a finite 
number of different constants, and Ap similarly: 

Corollary I. If 

i/0»3s) — /(aft)I ^ ^\xs —Xx\ 
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for all values of and x», 7. being a constant . then 
1 /O) — S n (a-) | < — 

— 71 

for mi = 2w + 1 and for m = 2n. 

Corollary n. If fix) is continuous inth modulus of <on- 
tmuity w(<5), 

: /(a?) — 8 n (a?) ^ A a {ppj log n 

for m = 2n -\-1 and for m — 2n. 

Corollary Ha. The interpolating sum Sni-r) converge'" 
uniformly to the value fix), as m becomes infinite through 
orld or even values (or both), if fix) has a modulus of con¬ 
tinuity <o(d) such that linv =0 co(d) log 3 = 0. 

Corollary HE. If fix) has a ptli derivative f p) (x) such 
that 

|/ cp) fo)— /<*>(:*)! ^ Mxs-x, 
for all values of Xi and x s , X being a constant, then 
I fix) - «„(*)' ^ 

for m — 2 n + 1 and for m — 2 n. 

Corollary IV. If fix) has everywhere a continuous 
pth derivative with modulus of continuity w(3), 

1/6*0 ~Sn(x)\ < {pjp log n, 

for m — 2n +1 and for m — 2n. 

In each of these statements, the conclusion holds for all 
values of x, and for all values of n^ 2; the coefficient A is 
an absolute constant, ivhile A p depends only on p. 

3. Convergence under hypothesis of continuity 
over part of a period 

The next proposition is analogous to a result obtained in 
the second and third paragraphs of Chapter II: 
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t.kmma II. If fix) is a function of period 2n, bounded 
and integrdble m the sense of Utemann, 

lim(l /m)2f(.t*) cos nt, — lim(l /m)^f(t r ) sin ntr = 0 , 

as m becomes infinite through odd values (w* = 2n-\-l) or 
even values (m — 2n) or both. 

There is a noteworthy difference in the hypothesis imposed 
on fix); it is in the nature of the case no longer sufficient 
to assum e merely that the function is summable, or summable 
with its square, since an enumerable set of very large values 
of fix), while not affecting the definite integrals of the 
earlier chapter, might throw the present sums entirely out of 
proportion. 

Suppose first that m = 2n+ 1. Let the summation be 
extended for definiteness from r — 0 to r = 2n. Since 
t r — 2m/i2n-{-l) l 

. 2rnn 2r(« + 4)?r — rn rn 

nt r — t:— j— 7 - =- . -= rn — —— r -~, 

2 w-f 1 2n -\-1 2 n + l 

V 7T 

COSH tr = (— l) r COS , 

j 2n 

1 y 1 2 n I \ 2jn 

2 tv j —0 2 h + 1 \ 2 w + 1 / 2n -\-1 

1 ^ 2 tv [2 (2j — l)?r\ (2j — l)n 

—5S ,S ^+r / l~ 2 - M +l J cos 2«+l ■ 

When 11 becomes infinite, the first sum in the last member 
approaches 

(*7t 

J 0 f (2 u) cos u du, 

by the definition of integrability, and the second sum has 
the same limit, so that the whole expression approaches zero. 
(Neither sum as it stands is exactly of the form that would 
ordinarily be written down in defining the definite integral, 
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but the discrepancy in each case amounts to half of a single 
term, and approaches zero in the limit.) As 

sin nt r = (— 3)' + 1 sin , 

similar reasoning applies to the expression 
[ 1/(2 w + 1 )] 2 / (*r) sin nt t . 

If m = 2 n , the summation going from r — 0 to r = 2 n — 1, 
t r = rn/n, coswf, = cosr/r == (—l) r , 

.2/(40 cos »4 

_ _1_ V " W ^ jrr /(2?-l)7r \ 

2n jt? 0 n J \ n ] 2n J tf 1 n J \ n /* 

Each sum in the last member approaches 

J^/(2m) du, 

and the whole expression approaches zero. As for the other 
expression in the statement of the lemma, sinw U —0 throughout. 

Suppose fix) is a function of period 2 jt, bounded and 
integrable in the sense of Riemaun, which vanishes for 
r„— ij <x-^ x 0 + ij. It be seen that 8 n (cc 0 ) converges 
to the value 0 , whether the values of m entering into the 
definition of the sums Snips) are even or odd. In both cases. 
S n (x) can be represented, artificially but with readily verifiable 
accuracy, by the single formula 

SJjc) — 1 2/(40 sinner — x) cot-^-(4- — x) 

+ t 1 “ (— i)” 1 ! 2/(40 cos nitr — x). 

As cosw(4-— «)==coswa; cosw^+sin nx sin«4-, the last 
sum, taken with the factor 1 /( 2 m), approaches zero uniformly 
for all values of x. And as fit) cot ^ (t — x 0 ) is bounded 



126 


THE THEORY OF APPROXIMATION 


and integrable in the sense of Riemann as a function of t, 
the first factor vanishing where the second factor is large, 
application of Lemma H to this function shows that the first 
part of the expression for S n (x) approaches zero for x — x 0 . 
If two functions, each hounded and integrable in the sense 
of Riemann, are identical from cc 0 — >/ to a? 0 + 7 , the difference 
of their interpolating expressions converges toward zero at x 0 ; 
if f(x) is any function of period 2 tv, bounded and integrable 
in the sense of Riemann, the convergence of the corresponding 
interpolating sums S n (x) at any specified paint, as m becomes 
infinite through odd os' even values, depends only on the 
behavior of f(x) in the neighborhood of the point m question. 

Now let f(x) be of period 2n, bounded and integrable in the 

sense of Riemann, and identically zero for a— q ^ x <- 

and let attention be directed to the problem of showing that 

S n (x) converges toward zero uniformly for a x fi. It 

was noted in the last paragraph that S n (x) either is given 

identically by the expression 

1 X 

m zJ f(tr) sin n(U — x) cot (t r — x), 

or differs from it by an amount which approaches zero uniformly 
for all values of x as n becomes infinite. Furthermore, the 
terms resulting from the expansion of sin n(U — x) may be 
considered separately, and as the factors sin wo; and coswa; 
are hounded, it is sufficient to demonstrate the uniform con¬ 
vergence of the expressions 

COS ntr cot ~(.tr—x), 

YU* & 

~2f(tr) sin ntr cot-| -(tr — x), 

as a? in the argument of the cotangent ranges over the inter¬ 
val («, fi). 

Suppose in the first place that m — 2w+l. Let x he 
restricted to the interval Let C(x, f) be a function 

which vanishes when 1 1 — x j differs from an integral multiple 
of 2rt by less than and is equal to cot \ (t — x) elsewhere. 
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Then f{t) C(x, t) is identical with f(f) cot£(#— a), under the 
restriction imposed on x, while C(x, t) is bounded for all 
values of x and t, never exceeding cot \n in absolute value, 
and is JS-integrable as a function of t. By adaptation of 
the formulas of an earlier paragraph, 

[2 tt/(2w +1 )] 'Zfit,) cos ntr€{.x, U) 


is equal to 


~sr 2tv f ( 4/?r \ 
A 2n-\-l J \2n+l) 


cos 


2 j?r 
2 w + l 



jjJL \ 
2n + l) 


minus another expression of similar form. Let uj=2jn/^2n-{-l). 
j = 0, 1, • ••, n, and let Auj = 2n/(2n-\-l) for j = 0, 1, •••. 
n — 1 , while Au n — n/(2n-\-t). Then the sum just written 
down differs merely by the quantity 


n „[ Ann \ 2nn 
2n+l^\2n + l) COS 2« + l 



Ann \ 
2 n+l/’ 


which approaches zero uniformly for all values of x , from 

n 

2 /(2m,) cos uj C(x, 2uJ) A uj, 
j=o 

a sum corresponding to a subdivision of the interval (0, n) 
into w+l parts, not all of equal length, and defining the 
definite integral 

J>«> cos uC(x, 2u ) du 

in the limit. 

The essential point for the present argument' is that the 
sum differs from the integral by an amount which approaches 
zero uniformly with respect to x. Let Mj and m, be the 
least upper bound and the greatest lower bound of the 
integrand over the/h sub-interval, Pj and pj the corresponding 
bounds for the factor /(2m) cos m, and Qj and qj those for 
the factor C(x, 2 u). The difference between the sum and 
the integral does not exceed 

IE (Mj — mj) Attj. 
j=o 
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Let M be the least upper bound of j/(2?t) |, as u varies 
without restriction, anupperbound consequently for | f(2 u) cos u |, 
and Q(= cot by) the least upper bound of | C(x, 2u) !. Then 

Mj — mj <: M{Qj—qj)+Q{Pj — pj), 
since for any u x and of the sub-interval in question 

/(2m 2 ) cos u s C{x, 2us) — f{2vf) cos u x C{x, 2u x ) 

— /( 2us) cos ii s [C(x, 2us) — C(x, 2%)] 

-\-C(x, 2u x ) [ f(2ns) cos« 2 —/(2*fri) cos Ux\. 


As f(u) is by hypothesis inferrable in the sense of Riemann, 
/(2m) cos u is JS-integrable also. Let e be an arbitrary 
positive quantity. By the property of integrability (more 
specifically, by Darboux’s theorem) it is possible to choose n 
so large, and the intervals Auj in consequence uniformly so 
small, that 

-2 (Pj —Pj) Auj< Q (Pj —pj) A uj < ~ . 

On the other hand, 

£(«;-?,) 

J = 0 

can not exceed the total variation of C(x, 2u) as u ranges 
over the interval (0, n), a total variation which is finite and 
independent of x, being equal to 4 Q. Hence if n is taken 
so large that 2n/(2n-\-l)<e/(SQM), it follows that 

ZM(Q J -q J )Au J <~. 

j—o & 

So there exists an N, independent of x, such that for w iV 

n 

Z(Mj — wij) Auj<C.e. 

j=o 

The other sum analogous to the one just treated is 


T 2 ” f / 2(2? —l)7t \ 

J £i 2n + l J \ 2n-fl / 


(2j-l) 
2 n + 



2(2y—l)*r \ 
2n + l /' 
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By addition of the term 


rr / 2 (2 n — 1) rr \ 

2 «-fl / \ 2 «+I / 


cos 


(2 n —1 ) rr 
2 « + 1 " 



2(2 « —1 ). t \ 

2ii-fl~~r 


■which approaches zero uniformly, this becomes a sum ap¬ 
proximating the same detinite integral as before, and corre¬ 
sponding to a subdivision of the, interval of integration into 
n — 1 parts each of length 2 .t/(2 m + 1 ) and one part of 
length 3 jt/(2w + 1 ). The difference between the sum and 
the integral approaches zero uniformly, and the difference 
between the two sums does the same, which means that 

2f(t,) cos n t, cot y ~ J”) 

converges uniformly toward zero. The corresponding ex¬ 
pression with sinwf, in place of cos nt, can be treated in 
the same way. This completes the proof of uniform con¬ 
vergence of (S',i (x) for odd values of n. 

For m — 2n, the discussion of the sum containing cos n t r 
is slightly simplified in form by the fact that cos nt, reduces 
to (—l) r , and by the fact that 2 rtfm is an exact submultiple 
of rr, but otherwise follows essentially the same lines as 
before. The sum with sin n t r vanishes identically for all 
values of n, since sin n i r — 0 . 

The conclusion is embodied in 

Lemma IE. If fix) is a function of period 2 rr ichicJi is 
bounded and integrahle in the sense of JEtiemann , and which 
vanishes identically for rc — V ^ a ^ the corresponding 

interpolating sums S n (x) converge uniformly toward zero foi 
a <- ft, as in becomes infinite through odd or even inJues, 
or both. 

On the formal side it may be pointed out in passing, and 
might have been noted before, that for m even S n (x) has 
the alternative representation 

— —— sin n xSj f(tf) cos n t r cot -x- (U — ^ • 


9 
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though the appearance of sin nx as a factor of the whole 
expression is illusory to the extent that corresponding to 
each value of x for which sum a.- = 0 there is a factor 
cot — re) which becomes infinite. 

L emm a III may be combined at once with Corollary II a 
of Theorem 1 to give 

Theorem]!. If f(x) is a function of period 2 tc, bounder! 
and integ ruble in the sense of Riemann over a ye? iod, and 
continuous for «—// ^ r < ft -f- tilth a modulus of ioh- 
tnvuity «(d) such that lim^ =0 o»(d) log d = 0, the correspond!nr/ 
interpolating sums S n (x) converge uniformly toward f(x) for 
a ft* as m becomes infinite through odd or even values, 

or both. 

4. Convergence under hypothesis of limited variation 

There are theorems of convergence for functions of limited 
variation, analogous to those obtained in the case of Fourier 
series. Let f(x) he a function of period 2 it, with limited 
variation over a period, and let x have a value, to he regarded 
for the time being as fixed, such that f(t) is continuous for 
t — x. It will be shown that 8 n (x) converges toward/(re) 
for the value of x specified. 

Let gpi (<) and gp s ( t ) he the positive and negative variations 
of f(t), measured from the point t = — 2tt, say, so that 

fit) = /(— 2 re) -f- 9l ( t ) — cp s (t ), 

while spi and are monotone increasing, and continuous for 
t — x. (The point x may of course be thought of without 
loss of generality as belonging to the interval (0, 2nr), and 
then there will he no occasion to take account of values of 
t less than — 2n.) Consider the case ?» = 2»+l. If x 
is one of the numbers U, for a specified value of n, S n (x) = fix) 
exactly. Otherwise, let tn be the number U which is nearest 
to x (or one of the two such numbers, if’ two are equally 
near), so that 

tx ~ 2n + l ^ ^ ! *+ 2'n+l ' 
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Let the summation over r in the formula representing' (y) 
be extended from J? — ntoR-\-n. The corresponding formula 
with f(U) replaced by 1 represents 1 identically, since 1 is 
a trigonometric sum of order zero. If th*is identity is multi¬ 
plied by the quantity f(x), independent of r, and the result sub¬ 
tracted from the formula for 8 n (r), the difference S„ (.r) — f(jr) 
is obtained in the form 


8 n (x) —f{x) — 


2n-\-l r=R—n 


lam—/ uo] 


sin(w+^Kf> —.cl 

ton$(?r—a”) 


The right-hand member is equal to the difference between 


2w + l r^R-n 


[gpi (U) — spi (x)J 


sin (n -j- (fr — o) 
sin i (t r — .r) 


and the similar expression with $p a (fr) — 9 >a(x) in place of 
'Pi (t r ) — 9i(x). 

The quotient of sines never exceeds 2w + l in absolute 
value. In the term corresponding to the index r — R, 1r 
approaches x as n becomes infinite, and y> t (t R ) approaches 
<p y (x), by the continuity of cp y , and the whole term, multi¬ 
plied by 1/(2« +1), approaches zero and need not be further 
taken into account; it is sufficient to consider the sum from 
R -j-1 to R-\-n and the corresponding sum from R — n 
to R — 1. 

Since t R +j — x — t R — x + [2j tc/{ 2 n +1)], 
sin fn + y) (ts+j — x) = sin £ (n + y ) (t R — x) +j n J 
= (— 1) J sin {n + y) (t R — x), 


and is independent of j, except as to algebraic sign. If 
ilfn+j— x) is denoted by u j7 and if c stands for 0 or 1 
according as sin (n + £) (t R — x) is positive or negative, 


2 n ■+ 1 r^B+l 


[ti( 4) yiO®)] 


sin (n + (U — x) 
sin £ (tr — x) 


= (— D' 2 ?(— 
1 


9* 
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where 

A; = </’i U + 2 i<j) — gr>i l.r ), 


4 


|sin(n + — ag) | 

(2 w+Dsin iij 


The numbers Aj . Bj are all positive or zero. As j increases, 
Aj increases or lemains unchanged; Uj is always between 0 
and rr/2, sm Uj increases, and Bj decreases. Let V be the 
total variation of f (a - ) over an interval of length 2 n Then 
the A's have the upper bound \ V, since the positive and 
negative variations of a periodic function over a period are 
each equal to half the total variation; while B J} which is 
equal to the absolute value of 

sin(n + 4) (tx+j ~ ■ r ) 

(2 n + 1) sm £(£??+;— or) ’ 
can not exceed unity. 

After the analogy of the proof in the case of Fourier senes, 
where the second law of the mean was employed, it would 
be natural here to use the method of summation by parts. 
The formulation can be slightly simplified, however, by reason 
of the fact that the sequences A Jt Bj are loth monotone, 
though varying in opposite senses. In general, let a i} ■ • •, a p , 
bit •••} b p be two sets of p numbers each, satisfying the 
conditions 

cti ^ Ug ^... ^ cip ^ 0, hi ^ 1 2 ^ ... Ip ^ 0. 

Let Cj= bi — bj, so that 


0 — Ci ^ Cg ^ ... ^ Cp < ^~~~ bi . 

Then 

( IV dj lp—j+i == cii bp — fig bp —i -\ - 

j=i , 

— Oh Cp) — (& i Cp —i) -f- • • • 

= hi (<Xi «s H— •) — (cii Cp — 0.2 Cp —i -f- • • •). 


Since both the a’s and the c’s are non-negative and decrease 
monotonically, the value of each parenthesis in the last 
expression is positive or zero, and not greater than its leading 
term, in one case at, and in the other case ax cp, which in turn 
is not greater than ai li. So the whole expression is the 
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difference of two non-negative quantities, neither of which 
can exceed 7^, and is itself not greater than tt t hi in ab¬ 
solute value. To emphasize by words rather than by subscripts 
what is essential in the conclusion, the absolute rulin' of 



— 1)J dj hp-j+t 


does not exceed the product of the largest of the it's hg the 
largest of tlic h’s 

Let e he an arbitrary positive quantity. Corresponding to 
the continuity of yi, let d-'O be chosen so that | ^(f)— (x\ 
^ for |f — jr- < d. Let the sum —lVAjlij of the 
second paragraph preceding be broken up into two. and 
2", in the first of which j ranges over the values for which 
2 Uj <. d, while the remaining terms make up the second. 
In the largest value of Aj is not greater than A f . while 
Bj never exceeds 1, so that by the conclusion of the last para¬ 
graph, with suitable adaptation of the subscripts. 

In ]£", A] <L\V throughout, while Bj < 1 /[(2 n +1) sin A d], 
consequently 

1.2" I ^ T T /[2(2«+l)sinyd], 


which is less than as soon as n is sufficiently large. 
Hence or, in the earlier notation, the sum from 

jR + 1 to B-\-n, multiplied by 1/(2w + 1), approaches zero 
as n becomes infinite. The sum from E — n to E — 1 and 
the corresponding sums with in place of q>i can be treated 
in the same way, to show that S n (x) — fix) converges 
toward zero. 

If /(a;) is continuous everywhere, the uniform continuity 
of g»i and sps, bearing on the choice of d in the preceding 
paragraph, and, as a detail, on the convergence toward zero 
of the single term in each sum for which r — B, yields at 
once uniform convergence of S„ (x) toward fix). 

Finally, a precisely similar argument can be carried through 
for the case of an even number of interpolating points, 
m — 2n. 
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The conclusion is 

Theorem 3H. If fix) is a function of period 2 it having 
limited variation over a period, S n (x) convei'ges toward fix) 
at every point where fix) is continuous , when m becomes in¬ 
finite through odd or even values, or both, if fix), still sup- 
posed to be of limited variation, is continuous everywhere, the 
convergence is uniform for all values of x. 

This result can be generalized immediately by reference 
to Lemma EH, and the more elementary fact that convergence 
at any point depends only on the behavior of the function in the 
neighborhood of the point: 

Corollary. If fix) is a function of period 2 it, bounded 
and integrdble m the sense of Hiemann over a period, of 
limited variation for Xq — y x f Xo-fy, and continuous 
for x — x 0 , S n (xo) converges toward fix 0 ); if fix) is con¬ 
tinuous and of limited variation foi' a — y < x ft f-V, 
the hypothesis remaining otherwise unchanged, iS n (x) converges 
toivard fix) uniformly for u x < ft. 

5. Degree of convergence under hypotheses 
involving limited variation 

Let fix) once more be of limited variation over a period, 
with total variation V. Let cpiix) be the positive variation 
of fix) from 0 to a? when x>0, and minus the positive 
variation from a: to 0 when xCO, and let sp a (x) be the 
correspondingly defined negative variation function, so that 

fix) — /(0) + SPi(«) — 9>a(a0, <Ti(0) = y s (0) = 0. 

Let m = 2w-fl, n> 0. Consider the sum ^fiU) cos ntr, 
extended specifically from r = —n to r = n. The sum 
2fi 0) cos nU vanishes. In (fr) cos ntr, the term corre¬ 
sponding to r = 0 vanishes with (0). It has already been 
observed that cos»k = (— l) r cos [rnl(2n-\-l)]. In the 
sum 

71 

Zft (4) COS ntr, 
r=i 
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the factor spi(A) increases monotonically, having the upper 
bound <jpi (tt), while, as r«/(2» + l) remains within the limits 
(O, W2 ), | cosw^I decreases monotonically, having the upper 
bound 1. Consequently, by the reasoning of an earlier 
paragraph, 

H 

£ 9i (f») COS n tr ^ ?'1 (zr). 

I— X 

Similarly, the absolute value of the sum from —n to —1 
does not exceed | g>i (— n)\. So the whole sum from — n 
to -\-n does not exceed 

Ti 00 + I SPi C— ») I = 5Pi 00 — sr>x (— n) — 4- r. 

£ 

After analogous reasoning with y s in place of g<i , it is con¬ 
cluded that 

I .S/Or) C0S%#r| <j T. 

There is a corresponding inequality 

12/00 sin w/,| <g V, 

the proof being simplified in this case by the fact that the 
monotone sequences |g>i(ir)|, | <jp 2 00 |, |sinwfr| vary in the 
same sense. 

For m — 2n a still simpler calculation, based on the fact 
that cos nt r = (—l) r , shows that 2/00 cos nt r \ < T, while 
2/00 sin wk = 0. 

Disregarding the special simplicity of the last observation, 
it is possible to state comprehensively 

Le m m a IV. Iff(x) is a function of period 2 n, with limited 
variation over a period, its total variation being T, then 

12/00 COS ntr\ V, |2/00 Sinwfrl T, 

whether m is odd (m = 2w + l) or even (m = 2n). 

Let f(x) be of limited variation over a period, with total 
variation V, and identically zero for a — n ^ x ^ £+ *?• 
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It has already been pointed out in substance that if e m is 
a symbol denoting 1 or 0 according as m is odd or even, 
(x) satisfies the identity 

m S H O) = cos n cot — (t, — x) sin n t, 

— sin n fit ,,) cot — ( t ,— x ) cos nf, 

-j-e OT cos nx£f(t,) cosnt, + sin nx^fiU) sin nt r . 

For any value of x in (a, ft), the product fit) cot \(t — x) 
is of limited variation, regarded as a function of t, and its 
total variation does not exceed the product of V by a quantity 
depending only on 7 . So the application of Lemma IV to 
each of the four sums in turn leads to the 

Corollary. If fix) is a function of period 2n until limited 
variation, the total variation over a period being V, and if 
fix) vanishes identically for a — V ^ x <^ ftthen 

10 * 0)1 < (\Ytn 

for a < x <: /3, where Cy depends only on g. 

This Corollary in turn may be associated with Corollary II 
of Theorem I, in the light of the discussion leading up to 
Theorem Va in Chapter II, to yield 

Theorem IV. If the function fix), of period 2 n, is con¬ 
tinuous with modulus of continuity a> (d) foi a —17 <C| as ^ ft-fg, 
where ai&)>0 for d> 0 , and of limited variation over the 
rest of a period, then 

I fix) — S n 0)| fg c or (2 7t/n) log n 

for a x ^ ft, if n is large enough so that &>(2 nhx) has 
a meaning, c being a constant which depends neither on x 
nor on n. 

Lemma IV is analogous to Theorem III of Chapter II, the 
sums in the present lemma, multiplied by a quantity of the 
order of 1 In to represent the length of the interval between 
successive points U, corresponding to the integrals in the 
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earlier formulation, though as the length of the interval is 
either exactly or approximately i r/w. according as m is even 
or odd, the agreement does not extend to the numerical values 
obtained for the constants in the right-hand members of the 
inequalities. It is sufficiently noteworthy, however, that the 
agreement should be as close at it is, since for any particular 
value of n the number of points used is not large enough 
to make it all evident a priori that the sums give close 
approximations to the values of the integrals, in view of the 
presence of the factors cos nx, sin nx under the sign of 
integration. 

The analogy can be extended to the conclusions of Cor¬ 
ollary I of the theorem referred to. The proof is perhaps 
most readily given by means of the Corollary itself. Siqqwee 
fix) has a pth derivative with limited variation. Then fix) 
is represented by a convergent Fourier series; if the Fourier 
coefficients are denoted by u k , fa, 


+ 2 («fc cos lxfa sin 7t ./•)• 

fc=i 


The series may be used to represent f(t r ) m evaluating the 
sums cos nt r , 2f(tr) sin nt r . Let m = 2«-f 1. The 

expression cos 1 t r cos nt r vanishes unless k — n or Je + n 
is an integral multiple of 2 n +1, that is, unless k has one 
of the values n, w+1, 3«+l, Sm+2 , 5?H~2, 5w+3, 
The value of the sum in each of these cases is (2 n-\-1 )/2. 
The sum £ sinktr cos nt r is zero in all cases. Consequently 

2 fOr) COS lltr — [(2%+l)/2] («w+aft+lH-ato+l+ a 3rt+2+ • * •). 

Similarly, 

2f<*r) sin nt r = [(2«-t-l)/2] (fa—fa+i+ A»+r— A»+a+ • • 
But by the Corollary cited, 

| « fe i ^ V/iTtk^ 1 ), fa\<rl (7ikr+i), 


if V is the total variation of f lp) (x). So 
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1 «»+ a 3n-{-l J r «< 5»+2 • • • | 

„ f r i i . 1 _l i . 

= 7T [ WP+ 1 "*■ (n+l)^ 1 _t ’ (3 W+1)* +1 _r (3 n+2)P+ 1 ‘ r "\ 

^ V[ 1 , 1 , 1 , 1 _L_ 

- n ^ +1 tew )®* 1 r ( Sw )^ 1 n 


27 

jr 


r,. 1 -i 

L 1 . .1 

SP+1 : 

h 5 P+ 1 + J 


the series in the last pair of brackets being convergent for 
p 2> 1, and having incidentally, for p^> l, the upper bound 

1 + 1/3 S +1/5 8 H-, independent of p. A similar inequality 

holds for the series of fs • If m = 2 n. 


JEf(t r ) cos ntr= 2 n («,*-)- a 8 »-f- «5» 


and .2/*(&■) sin nt, — 0. Whether m is odd or even , there¬ 
fore, 

1 2fiU) cos « u | ^ CVtnP, 1 2 f(tr) Sin nU \ ^ CYln ?, 

where C is an absolute constant. 

This result, supplemented by a type of argument which 
has been used a number of times already, gives at once 
certain information with regard to the degree of convergence 
of interpolating sums S n (x). hor example, if fix) has a first 
derivative which is of limited variation over a period, and 
if fix) vanishes identically for a — ij ^ x ^ then 

|/S»G») | can not exceed a constant multiple of 1 /n* for « ^ a? <1 £; 
if fix) has a first derivative which is of limited variation 
over a period, and if fix) is continuous with modulus of 
continuity «(<J) for a — a? ^/S-}-*?, but not identically 
constant over the interval, then | fix) — S n ix) | can not exceed 
a constant multiple of ( 1 /n) ®(27r/n) logn for a x 0. 
But such observations are of secondary interest; and it is not 
possible to pass over immediately to a proposition analogous 
to Theorem IV of Chapter H, for the transition from a specified 
interpolating sum 8 n ix) to one of higher order does not con¬ 
sist merely in the inclusion of additional terms directly subject 
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to the inequalities that have been under discussion. For the 
problem thus suggested a different procedure is required. The 
detailed treatment -will be limited to the next case in order, 
that of a function having a first derivative of limited variation 
over a period (or expressible as the integral of a function 
of limited variation), and not otherwise restricted. 

The key to the discussion of this case is the fact that it 
fix) is the integral of a function of limited variation over 
an interval a <, x <L b, the quotient [fix )—/(«)]/(.?— a) 
(defined in any way, by the value f {a +) or otherwise, for 
x = a) is likewise of limited variation over the interval. Let 

/(a?) = /(«) + J a 9 (x)dx, 

where y(x) — 91 ix )— (x) , and yi and y- are bounded 

non-decreasing functions from a to 6. Let 



for a<.x<^b, while ©i(a) = <n.(a), and let a function © a (,0 
be similarly defined in terms of <jp a (a:-). Then, for x a. 

m=m . = a>.fa)-a>,fx). 

CU Cl 

The assertion with regard to the quotient on the left will 
be proved if it is shown that ©t (x) and © a (:c) are non¬ 
decreasing, the fact is rather obvious, and in formulating 
a proof it is clearly sufficient to consider one of the two 
functions. For any value of x>a, 

ix — a) spi (a) < r $Pi {x) dx if. ix — a) 5P1 (;r). 


by reason of the monotone character of jPi , so that 
Ofx) ^ 9 i ia) = ©i(a). Let x t and x s be any two values 
such that a<.x 1 <Cx i ^ b. Then 



g>! (x) dx 



(ji — a) Xi , 

ixi rt) -i-i » 
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where 

(fi (a) < A\ <1 (ft <= A' a is 9t 
So 

I 9 i (<r) (lx (a"i — (i) At -f- (.r a — ./‘ 1 ) At = : (f) At * 

w a 

whence </>iU’s) ^ A\; as A\ — ©1 (^). the desired conclusion 
is established. Similarly. lfix)—fib)]/(b — x) is of limited 
variation. 

Xow let fix) be of period 2/r, and (for simplicity of 
statement) provided with a first derivative of limited variation 
over a period. The function (x — a) cot ^ (x — a), defined 
so as to be continuous for x = a, has a continuous derivative 
for a <Lx a-\-rc, and so is of limited valuation, over this 
interval. Hence the product 

{[/(a")—/(«)]/(*—«)} j(*—a)cot * (« — «)} 

= [fir) —/(<?)] cot \ (x—a) 

is of limited variation over (a, a -j- n), since tins is true of each,of 
the expressions in braces. Similarly, (x — a —2 it) cot ^ {x — a) 
is of limited variationfora+7r^a:^«+27r; as/(a-f- 2 ri)~f(a), 
the expression [fix )—/(a)]/(a-f 2 it — x) is the same as 
[/(ar) — /(a+2 re)] / (a + 2 tc — x), which is of limited variation 
over (a+ar, a~\-2rt); and so [fix )—/(«)] cot \ (x — a) is of 
limited variation over this interval also. In summary, and in 
slightly differentnotation, the expression [fit) - -fix)] cot J it- x), 
regarded as a function of t, is of limited variation over any 
interval of length 2n. 

Let the interpolating sum for fix) be expressed once more 
in the form used in connection with the Corollary of Lemma IV. 
The interpolating sum for a constant reproduces the constant 
identically; since fix) is a constant with respect to the index 
of summation r, the quantity given by the formula reduces 
identically to mfix), if fit,) is replaced by fix) under ,the 
sign of summation, and 
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:)—/(r)] 

= cos n x2 LAM—/U-)] cot l (t, — x) sin n t, 

— sin nx 2 [/(*>) —/(•*)] cot (t, —x) cos n t, 

+ e m cos nx2 L/X^i) —/(•*•)] cos n t, 

+ e m sin n x 2 LAt) —/(«*)] sin n t ,. 

But f(t)—f(r), as well as [/(<)— ■/(/)] cot 4 (t—x). is of 
limited valuation with respect to t, and consequently Lemma IV 
is applicable to each of the sums on the right. Closer 
inspection shows that the total variation in each case is not 
greater than a constant multiple ot the total variation of 
f'(x ) over a period. If this fact is incorporated in the 
statement of the result, the conclusion may be formulated as 
Theorem V. If f(x) is a function of period 2 tc having a,first 
derivative, icith limited, variation, the total variation of f (or) 
over a pei wd being V, then 

fix) — S n (x)i CV/n, 

ivheie C is an absolute constant. 

The corresponding analysis for the case of functions having 
higher derivatives will not be earned through here. It may 
be pointed out, however, as a first stage in the extension, 
that if /(as) has a continuous non-decreasing second derivative 
for a <=x <L b, then [f(x )—/(a)]/ (x —a), considered to have 
the value f'(a) for x = a, has a non-decreasing first derivative 
over the same interval. For x = a, 


d [/<£)—/(«)_' 
dx L x — a 


2 /"«> 


as may be seen by calculating the derivative from first 
principles and applying the extended mean value theorem in 
the process. For x> a, 

/(*)—/(«)! = (x-a)f(x) — [f(x)—f(a)] 
x—a J (x —a) 8 

= y/"(S) 2 


d 

dx 


J42 the theory of approximation 

the number £ lying between a and x. For x> a, furthermore. 


Jp_ 


ix—aff"{x)—2(x—a)f (x) -\-2[fia.)—f{a)] 

ft X s 

x — a m 

~ ix-af 


= > o 

x — a — 


Tlie monotone character of the first derivative thus becomes 
apparent. It follows that if fix) has a continuous second 
derivative of limited variation, [fix) —/(«)]/(&• — a) has 
a first derivative of limited variation, and connection can be 
made with the facts previously ascertained as to functions 
satisfying the latter condition. 


6. Formula of interpolation analogous 
to the Fej6r mean 

A considerable part of Chapter II was devoted to a dis¬ 
cussion of the arithmetic mean of the partial sums of the 
Fourier series. There is a corresponding formula in the case 
of interpolation, possessing many analogous properties, with 
the outstanding exception that it is not an arithmetic mean 
of a sequence of the interpolating sums previously studied. 
It is to be defined and examined on its own merits, with 
only incidental reference to the content of the earlier part of 
the present chapter. 

Let n be an arbitrary positive integer, and let U = 2rn/n, 
for any integral value of r. In comparison with the earlier 
notation, m is now to be taken equal to n, instead of 2n -\-1 
or 2n\ the sign £ will be understood to refer to summation 
over n successive values of the index r, when there is no 
indication to the contrary; a separate symbol m is no longer 
needed; and there is no occasion to distinguish between odd 
and even values of n. 

Let fix) be an arbitrary function of period 2 it. The inter¬ 
polating formula in question is 


°ft ( X) jtsfifr) 


sin 8 $%(4 — x) 
n s sin*^(k — x) 
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with the understanding that each term is defined so as to be 
continuous wherever its denominator vanishes. For ./• = f, } , 
one tenn of the sum reduces to f(t q ), namely that in which 
> = q, or in, which r differs from q by an integral multiple of n, 
while each of the other terms becomes zero, and consequently 

a n (t Q ) = f(t s ). 

So the interpolating property is apparent at the outset. 

It is an almost immediate consequence of identities previously 
employed that 

(sin* (sii’i-r) 

— 2 n“{-(n —l)cosu+(»—2)cos2r+-"+cos(»— l)pl. 

Hence, as a result of the substitution v — t, — x, 

(x) — | ffo ~h«i coscc-f- a s cos2aH-(-«n—iCOs(w—l).r 

+ hi sinsin 2x-\ -1-b n —i sin(« — l)x. 

with the coefficients 

aj „= 2 ^” n 8 — Zfitr) cos TiU, 
bk= 2 - ( \7 ~ sin ktr> 

The expression is a trigonometric sum, but of order about 
twice as high as would be required merely for the purpose 
of obtaining coincidence at n points. 

If /(*) =: 1, all the coefficients reduce to zero except cio, 
while = 1 , so that in this case <r n (.x) ==-- 1 , or, if 
(sin® mi)/ (sin 2 w) is denoted by d) n (ti), 


2 


sin® \njfr —a?) 
n a sin 2 4 (t r — x) 




1 . 


Hence it follows further, inasmuch as ®»(w) is never negative, 
that if fix) is any function having M as an upper bound 
for its absolute value, 
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«•»(.'•) = , 2 (1 /«*)/(&) ®« [ 2 — *)] | 

^ 2? a/ n s ) 0>« [ g (t, —.*■)] = M. 

Like the arithmetic mean associated with the Fourier series, 
the present <r n (x) converges uniformly toward /(a?), as 
)i becomes infinite, if f(x) is everywhere continuous, and 
converges at points of continuity under more general hypotheses 
as to the behavior of the function elsewhere. Inasmuch as 
the sums depend on the values of the function at isolated 
points, however, it is necessary to impose some restriction 
on the values which it may take on point by point, not 
merely to require that it be summable, or in other words 
that it have a finite mean value. 

Let f(t) be continuous for t = x, and let | fit) <L M 
everywhere. By a device already used on a number of 
occasions, the error of <r n (&) can be expressed in the form 

o'* O) —fix) =2(1/n s ) [f(U) —f(x)] <D n [ g (t, —as)]. 

Let * be an arbitrary positive quantity, and let & be a positive 
number such that |/(f)— f(x) \ < for 1 1 — x\ < d. Let the 
sum in the right-hand member of the identity for <r n (x) — f(x) 
be written in the form JS'-f 2', where 2 denotes summation 
over those values of r for which U — x differs from an integral 
multiple of 2n by less than d, and 2" stands for a summation 
covering the remaining values of r. In 2 > ^fif )— 
so that 

2'^/n s )[f(U)-f(x)]d>n\\ (t,-x)] 

^ 2'(l/n a ) (e/2)<Z>»£-~ (U— a:)J 

<2(l/n s )(s/2)0 n [!-(#,-*)] = *. 

In 2", ®)1 ^ l/(sin 8 £d), and \f(U)—f(x)\ £ 2 M, 

while the number of terms can not exceed n, and consequently 
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2T(1 /«■) [/(*»)—/M] <*>» [ 2'(6—./)] < 2JT (m sin 2 —- <J) , 

which is less than 4 * as soon as n is sufficiently large. This 
establishes the tact of convergence. It is sufficient that n 
surpass a bound depending only on 31 and d: it fix) is con¬ 
tinuous everywhere. <5 can be chosen independently of x, 
and the convergence is uniform. If fir) is continuous for 
« — v ^ x ^ ft-\- v - without being necessaiily continuous 
everywhere, it is possible to choose a i<5 which shall be 
valid for all values of or in the interval « <- ,r ft, and 
the convergence is uniform over the latter interval. Finally, 
if fix) , instead of being merely continuous for a — 

ft is identically zero there, and if x is given a value 
belonging to the interval («, ft), <Z>« [4 (t r — ./■)] <: l/(sin 2 4iy) 
in all terms in which f(t,) 4" 0, and 

c n (x) 1 < 31 /(n sin 2 4 f). 

The results may be summarized in 

Theorem VI. If fix) is a bounded function of period 2 -t. 
o» (as) converges toward fix) at every point at which fix) is 
continuous. If fix) is continuous every ichei e, the convergence 
is uniform everyudiere. If fix) is continuous for «—// x 
ft-fy, the convergence is uniform foi a < x ^ ft. If 
fix) is identically zero for « — ij <| x <[ ft + n , <s n ix) 
< Cy 31 In for ce ^ x ft, nhere 31 is an upper hound for 
\fix)\, and Cy is a constant depending only on //. 

Further discussion of the degree of convergence of ix) 
will be limited to the case in which fix) satisfies the condition 

\f(Xs)—f(x i). h\Xt Xi . 

Suppose first that this condition is satisfied everywhere. Then 

Wnix)— fix)\ <, iW)£ I^ — rrl <Z>„[-— (^- — ar*>]. 

For any particular value of x, let ts he that one of the 
numbers U which is nearest to x, or one of the two nearest, 
if x is equally distant from two of them, so that ts — (**/») 
x ts -f- in In) . Let the summation be thought of as 

10 
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extended specifically over the n values ot i for which x — -t 
< t, <.r-f-5T. The precise expressions for the extreme values 
of /• in terns of It and n will vary according to circumstances, 
they will he approximately 2 ? ± J «, and it is sufficient for 
the purposes of the present argument to note that they will 
certainly be between It — n and 2? -f- n , for any n ;> 1 . 

For any value of r, 

2^" u+<w—l)cosr+t«—2)cos2/4-+cost«—1)<J 

ii + (it — 1) + (n — 2j + • • • +1 jj = M s , 

and lienee 

Furthermore, 0 rj sin 4 c ^ rr for v <. rr. Throughout 

this interval, consequently 

_ /1 \ v siniwr .1 ; ^ . 

\ 2 / sra£r sm^i> 2 , — 

and at the same time (for r 4- 0) 


o n (\ v •) = A_ . 

\ 2 / , v | sin 2 v 


sin 2 1-Jir < 

2 — v 


These relations are to be used m connection with the 


inequality for : <r n (a?)— f(x) ' in the second paragraph preceding. 
If ,*■ is one of the numbers t r , o» (pr) =/(*), and there is 
no further question as to the magnitude of the error. If :>• 
does not coincide with a U, there are just two values of r 
in the summation for which f, — x\<C 2rein. In the terms 
of the sum corresponding to these values of r it is sufficient 
to apply the relation |c| <&,,{% v) < nrr, or \ t r —x [£(2 r — ?,j] 

utc. For the other points t,, in the order of increasing 
distance from or on each side, the values of j t, — x\ are 
successively greater than the numbers 2 -itIn , 4 n/n, Qrt/u, 
and by the relation j v\ (i v) < n a /\v the corresponding 
values of |f,— x\ — a-)] are respectively less than 

n nr/2, ni r/4, ni r/6, •••. Hence, as the sum involves not 
more than n points U on each side of x, 
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2 , t, - r if -A‘j| 

^ 2nrr-\-2 [?i nj2 + n tt/ 4 + « ;r/6 +-r a -t, (2 «)] 

= «>r[s+l+ | +-L + ...+ *] 

<s U7T J^B+ = n -t (3 + log «)• 

which lor n 2 does not exceed a constant multiple of 
u log n . This makes it possible to state 
Theorem VTX. If f{x) is a function of period 2 .t satisfying 
everywhere the condition 

/G*'*) —/Oi) < X .r± — x t , 
then, for all valves of 2, 

fix )— On O) < (C'l log n)/n, 

where C is an absolute constant. 

The conclusion may be generalized at once by combination 
with the last assertion of Theorem VI, to yield the 

Corollary. If f{sc) is a bounded function of period 2.t 
satisfying the condition 

\ f { Xi ) —/(sCi) < X Xi — X x 

throughout the mtei'val a — ^ < jr; < • then 

•fix) — o n {x)\ < (cX log n)/n 

for a ^ ar* <£. ft, where c is a constant depending neither on x 
nor on n. 

In conclusion, it may be pointed out as a peculiarity of 
the interpolating sum o n (x) that its derivative vanishes at each 
of the points t a (See L. Fej6r, Gottinger Nachrichten (1916). 
pp. 66-91, especially pp. 87-91). For in the identity 

c»(aj) = 2 (1 /»*) fit*) ®» [ | if — a-)J 

it is evident from the expression of O n in fractional form, 
or can be verified by differentiating this expression, that 

10 * 
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<t> n \k{tr — .r)] has a double root for x — t a if t, is not con¬ 
gruent to £* modulo 2n, -while the representation of ©«[£(4—#1] 
as a sum of cosines shows that it also has a vanishing 
derivative for x = t q . 

7. Polynomial interpolation 
As was indicated in the opening paragraph of the chapter, 
the methods that have been set forth are not adapted to the 
study of the problem of polynomial interpolation with equally 
spaced points- That problem is analogous rather to the theory 
of Taylor’s series, whether treated by means of Taylor’s 
theorem with the remainder for real variables, or by Cauchy's 
theorem in the complex plane. A simple change of variable, 
however, serves to carry over the formulas of trigonometric 
interpolation to a case of polynomial interpolation with 
unequally spaced points distributed in a certain way. Suppose 
namely that a function /(a?) is defined for — 1 x < 1. Then 
/(cos 0) is a function defined for all values of 0. It is an 
even function of 6. and inspection of the formulas defining 
the coefficients in the interpolating sums S n (6) and o»(0) shows 
at once that these sums involve only cosines, and so may 
be regarded as polynomials in cos 0. If x — cos 0, S n (0) 
and a n (0) may be denoted by U) and ti, x {x) respectively. 
They are polynomials agreeing in value with f(x) for a set 
of values of x corresponding to equally spaced values of 0. 
The polynomial n n (x), unlike the interpolating polynomial of 
minimum degree for equally spaced values of x, converges 
in the case of eveiy continuous function /(a;); its degree how¬ 
ever is approximately twice as great as the number of points 
for which coincidence is obtained. The expression P n (x), on 
the other hand, is an interpolating polynomial of minimum 
degree, and while it is not convergent for every continuous 
function, it converges far more generally than the correspond¬ 
ing polynomial with equally spaced points. It is unnecessary 
to enumerate the further theorems on convergence and degree 
of convergence which would be obtained by following out 
the transformation of variable in detail. 



CHAPTER V 

INTRODUCTION TO THE GEOMETRY OF FUNCTION SPACE 

I. The notions of distance and orthogonality 

In Chapter HI, attention was directed to the problem of 
the approximate representation of a given function by means 
of linear combinations of other given functions, according to 
the criterion of least squares. If f (pc) is a given function 
over an interval (a, 6), and if p ± (x), p 2 (x), p m Or) are 
m linearly independent functions over the same interval, the 
coefficients c l9 c m in an expression 

Sp fcO = c i p 1 (x)+ UrtH- * + c m p m (x) 

are to be determined so that 

f a [/(#) — <P dx 

shall be a minimum. The value of the integral is taken as 
a measure of the discrepancy between the functions fix') 
and sp (cc). 

The problem is the same in principle as that of the method 
of least squares for the approximate solution of a set of linear 
equations. Suppose there are n equations in m unknowns, 
n>m * 

fl&il X'l - 4 ~ (I2 1 X$ + • • • -j- dm\ Xfn -&t ? 

Xi -J— dg 2 "1“ * * * ”4” dm2 Xtn — b >, 

Ctui Ct?i -f- Cl'ZnX^ -j - * * * “4“ CCmnXm J —- * 

The set of numbers (&*, —, &») may be regarded as con¬ 

stituting a function 6 (ft) = in which the independent 
variable ft takes on only a finite number of values, the 
integers from 1 to n. In the same way, each of the sets 
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(an, • • •. chn) may be regarded as a function chQc). The a's 
then are the coefficients in a linear combination of these 
functions, 

wic = w (k) = Xi (ii (7r) + X 2 a* (Ji) -f- • • • + oc'ni a n % (/>'), 
and are to be determined so that 

Z (h— u k y 

fc=i 

shall be as small as possible. The sum of squares this time 
measures the discrepancy between the approximation and the 
function approximated. 

The form of the sum suggests a geometric interpretation, 
in which the functions bQc), wQc) are represented by points 
in M-dimensional space, with coordinates (6 1} b s , , l„) and 
(?fi, w n ) respectively, and ]£(bk — «•*,)* is the square 

of the distance between these points. By an extension of 
the same idea, the functions f(x) , sp (#) of the first paragraph 
are thought of as corresponding to points in a space of 
infinitely many dimensions, with J[f(pr) — sp(a:)] 2 dx as the 
square of the distance between them. This definition of 
distance is the beginning of a systematic geometry of func¬ 
tion space. 

Another hint of geometric analogy which is present from 
the beginning consists in the recurring use of the term orthog¬ 
onal, two functions being called orthogonal to each other 
over an interval when the integral of their product over the 
interval is zero. While it is worthy of some emphasis that 
orthogonality of the functions which form the basis of the 
approximation is a convenience rather than a necessity, the 
notion of orthogonality is inseparably associated with the 
least-square condition in another way. 

Let the functions f{x), pi (x), • • -, p m (x) be supposed con¬ 
tinuous, for simplicity of illustration. (It would be sufficient 
that they be integrable together with their squares, provided 
it is assumed that the p’s are properly independent, in the 
sense that every linear combination of them containing a non- 
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vanishing coefficient is different from zero over a net of posit ur 
measure; this condition is of course satisfied automatically if 
they are linearly independent and continuous.) In aider that 
< i, • • •, he the coefficients giving the least-squat e appt o.nmo¬ 
tion, it is necessary and sufficient that f(.r) — y {x) he orthog¬ 
onal to each of the functions p, {x), or in other words it is 
necessary and sufficient that f(x )— <p{x) he orthogonal to 
every linear combination of p x (r), • ■ •, p tn , (x). This can be 
verified algebraically, without a detailed examination of the 
necessary and sufficient conditions for a minimum in the 
calculus. 

Let ip(x) be an arbitrary linear combination of the j>'s 
with at least one non-vanishing coefficient. Then 

ot (x) ~-z gp O) + h if> 0) 

is a linear combination of the j/s, for any value of h. and 
every linear combination different from gp can be written in 
this form by a suitable choice of if>. "Let 

I =£[/&)-<p(.f)Ydx, 

J =1 If {a j)— m(x)Ydx = £ {[/(«) — SH<*)]— hi/i(x)} s dx, 
Jl =1 [/(cr)— gpfcr)] if>{x)dx, S =X [ty (x)]* dxZ> 0 . 

Then J= I — h(2R—hS). If R + 0, h can he chosen so 
as to make J<I (for example by taking h — R/S), contrary 
to the supposition that gp is the minimizing function, while it 
R = 0 , J — I+h' i 8>I for every h 4 0 . 

The conditions §[f(cc) — <p(x)]p t (x) dx = 0 , / = 1 , 2 , •••, m, 
are equivalent to the w linear equations 

W p b f»b 

2* Uj J a pj (x) jh( pc) dx —J a f (*) i>* (*) 

for determining the m c’s. It has been shown that these 
equations must he satisfied if the least-square problem is to 
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be solved, and that the problem will be solved if the equations 
are satisfied. For the moment there is still a question whether 
the equations have a solution. The answer is immediate, 
however; the left-hand members are independent of the func¬ 
tion /(ar), the least-square problem obviously has the unique 
solution t\ = (■* = ••• = r m — 0 in the particular case 
f(jr) = - 0, and consequently the determinant of the coefficients 
must be different from zero. If the p’s were not linearly 
independent, the problem of approximation could still be solved 
in terms of a linearly independent subset of them, and this would 
be at the same time a solution in terms of the original set 
of p's, but the solution in terms of the latter set as a whole 
would not be unique. 

Similar reasoning is applicable to the problem of the second 
paragraph of the chapter. Integrals are to be replaced by 
sums throughout, and in particular the property of orthogonality 
of two sets of numbers is expressed by the vanishing of the 
sum of the products of corresponding numbers of the two 
sets. The hypothesis of linear independence of the p's corre¬ 
sponds to the condition that the matrix of the coefficients <hk 
be of rank m. The condition of orthogonality characterizing 
the least-square solution takes the form that the set of 
residuals 1 >\,— Wk is orthogonal to each of the m sets of number s a,k 


ft 

2 (fife - ic k ) Oik 




0. 


The ith equation of this set may be constructed by multi¬ 
plying each of the given equations by the corresponding co¬ 
efficient of xt, and adding the equations thus obtained. Written 
out at length, the new equations have the form 

ijgS fc rtlfcj “l" Obnkj Xm 

n 

= 2 i — 1 , 2 , • • •. m. 

fc=i 

They are Idle weU-known 11 normal equations ” for the solution 
of the problem of least-square adjustment. 
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In certain cases the necessary condition of orthogonality 
can readily be translated into another familiar form. Consider 
once more the case of functions of a continuous variable x. 
Let fix) and q(x) be given functions for a <1 x <- b, for 
simplicity continuous, and let q (x) be non-negative and not 
identically zero over the interval, and let P n (x) be the poly¬ 
nomial of the nth degree which minimizes the integral 

f a Q( x ) lf( x ) —Pn (•*.)]* dx. 

Then the remainder f (x) —P„(a), if not identically zero nhen>- 
ever q (x) ^ 0 , must change sign at least n + 1 times m the inter¬ 
val (a, 6 ). If [9 (a ;)] 1 / 2 is denoted by q (x), the integral to be 
minimized is the same as J *[q (x)f(x) — q (a:) P n (cc )] 2 tlx, and 
the problem is that of approximating q (x)f(x) by a linear 
combination of the functions q(x), xq (x) ,<• • •, a* q (x ). By 
the general proposition obtained above, it is necessary that 
q («■)/(*)— q(pc)P n (x) be orthogonal to every linear com¬ 
bination of the functions x 1 q (x), or, in other words, orthog¬ 
onal to q (pc) Q n (x), if Q n (x) is an arbitrary polynomial ot 
the nth degree; in symbols, since [2 (a :)] 8 = q (x), 



Q (P) [f(pc) — 


Pn («)] Qn(x) ilx 


0 . 


It f(x) —Pn (x) had not more than n changes of sign, it 
would be possible to construct a Q n (x) having the same 
sign as f(x) —P n (as) at all points where f(x ) —P« (a*) + 0, 
and this would make the integral positive, unless q (pc) — 0 
wherever f(x) — P n (x) 4= 0 . The condition that Pn (a?) be 
such as to give the remainder the requisite number of changes 
of sign is of course not sufficient for the solution of the 
least-square problem, since it is satisfied by any polynomial 
differing sufficiently little from the minimizing polynomial. 
A corresponding formulation is possible in the case of ap¬ 
proximation by trigonometric sums, with or without a weight 
function. 

The least-square problem, in general as well as in the 
special cases discussed in the preceding paragraph, is that 
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of choosing from a given linear family of fmictions the partic¬ 
ular one which is closest to another given function, according 
to the measure of discrepancy specified. The geometric inter¬ 
pretation that has been suggested gains in clearness if each 
function, instead of being represented merely by a point in 
a space of an appropriate number of dimensions, is also 
represented alternatively by the vector from the origin to 
the point in question. The linear family of functions then 
corresponds to a linear spread in the geometric picture, and 
the function of closest approximation corresponds to the point 
ot this spread whose distance from the point representing 
the function to be approximated is as small as possible; the 
residual function/(a) — <p (ji) (or b(k) — u (7.)) can be regarded 
as the vector from <p (x) to f(a •) (or from ic (7) to b (7,)); and 
the property of ‘'orthogonality” characterizing it is associated 
with the fact that the shortest distance from a plane spread 
to a point outside it is perpendicular to the spread. The geo¬ 
metric terminology thus acquires additional plausibility. 

a. The general notion of angle; 
geometric interpretation of coefficients of correlation 

An obvious further step is to proceed from the notion of 
orthogonality to a general notion of angle in function space. 
As will be seen presently, the definition of angle is already 
implicit in that of distance, if the frame of Euclidean geo¬ 
metry is to be fitted consistently to functional relations, 
though the question as to the possibility of carrying the idea 
through systematically and without danger of internal contra¬ 
diction still calls for some elucidation. 

It will be well first to adopt a common notation for dealing 
simultaneously with functions of a continuous variable and 
functions of a discrete subscript. If x (t ), y (t) are continuous 
functions of t for a ^ t < 6, let 

(aj-y) = x(t)y(t) dt. 

If a?i, a? s , • • • , Xn, Vi, Vi, • • • , t/n are two functions of an index /. 
which ranges from 1 to n, let 
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n 

(*•!/) = 2 Xhi/L- 

/l =1 

The quantity (x . y) is the inner or scalar product of the two 
functional vectors. In particular, (x-x) =2>£ or J\rV/. 
etc. The condition for orthogonality of r and y is that 
(x ■ y) — 0. A. third case worthy of explicit mention is that 
of infinite sequences x lf x i} • • y 1} y a , .... such that 
and are convergent. The interpretation then is that 

0 - • y) — 2 i/l • 

Ti,~ 1 

All three are of course isolated particular cases from the 
point of view of general analysis. 

Let x(t), y(t), or x(k), y{k), be two functions represented 
by the points P, Q respectively, and let 0 be the origin, 
corresponding to a function which vanishes identically. By 
the convention already adopted, the sides of the triangle OPQ 
are 

OP* = (x-cr), OQ a =(y.y), 

PQ* = ((y — *) ■ (if — *)) = (y-y) —2(r-y)4-(.r..r). 


If the angle POQ is denoted by 6, application of the law 
of cosines to the triangle gives 

PQ* = OP* + OQ a —2 OP ■ OQ cos 0, 

whence 2 OP • OQ cos 6 = 2(x -y), and 


cos 6 = 


ix • £r) 1/s {y • y) 1/2 ‘ 


When the independent variable is It = 1, 2, 
formula becomes 

2*kyk 


COS 6 = 




n . the 


This is recognized immediately as a fundamental formula in 
the theory of statistics. It 'is the coefficient of correlation 
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between the variables x and y, if these have been reduced 
to the form of deviations from their respective arithmetic 
means, or, in other words, if £xk — 2t/k = 0. Subsequent 
pages will show that the geometry of function space throws 
much light on the structure of more complicated formulas of 
correlation. 

It becomes important to inquire more closely as to the 
logical basis for the identification of analysis and geometry. 
The essentials of such a basis are implicit in the considerations 
leading up to the proof of Theorem III in Chapter HI. To 
repeat in the present notation what is needed for the purpose 
in hand, restricting attention at first to a two-dimensional 
spread, let x, y be any two functions which are linearly in¬ 
dependent over one of the ranges specified in the third para¬ 
graph preceding (or over some other appropriate range); if 
the range is an interval, let it be supposed for convenience 
that the functions are continuous. Let 


£1 = x, 


h = y — 


ix 


&•?/) 


It follows immediately from this definition that (£ x • yi) = 0. 
By the hypothesis of linear independence, furthermore, it is 
certain that % is not identically zero. If 

s = *!/(*!. so*, v = ’IiKvi-vO 112 , 


the functions £, y are orthogonal to each other: (£-y) = 0; 
and they also satisfy the condition that (£.£) — (y>y) — 1. 
They are linear combinations of x and y, and it is seen at 
once that the determinant of the coefficients is different from 
zero, so that x and y conversely can be expressed as linear 
combinations of £ and y. 

The functions x and y being given, let § and y now be 
any pair of functions such that x and y are linearly expressible 
in terms of § and y, and such that 


('S-y,) = o, (£•£) = (v -v) = 1. 
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It has been shown that such functions ?, y can be consti'ucted. 
There are infinitely many pairs satisfying the requirements; 
the original I, // can be replaced by . 

if a i; A. > « 2 , A are any constants such that 

«i«a+>M* = 0. u\l+/% = = l. 

Furthermore, the requirements can be satisfied even if j- and y 
are not linearly independent; it suffices then to express x, y 
in terms of two linearly independent functions Ji, y x , and to 
construct functions ?, v\ as above with j\ and y t in place 
of x and y. 

In terms of a chosen pair of normalized orthogonal 
functions ?, •//, let 

x — «i £ + M> y — a»$ + h*l‘ 

Then 

(x• x) = a{-\-1\, (y-y) — a\+b%, 

(0/ —*) • (y—x)) = (« 2 — fl 1 ) 2 +(6 2 — b t y. 

If the functions x,y aie thought of as corresponding to the 
points P, Q, with the coordinates (a l} bt) and (a*, b a ) respectively 
m a rectangular coordinate system, while O is the origin, the 
quantities (x-x), (y-y), and ((y — x)- (y —x)) are the squares 
of the distances OP, OQ, and PQ. The cosine oj the 
angle POQ is 

0 x ^2 +Ms _O • y) _ 

(a? + &?) 1/2 («I + Vi) 1 * (x • x) 11 ' 2 (y . y) 1 ' 2 ’ 

More generally, if 

u = Zi x H - yi y = At 2r-j-I?i/;, v — h 3 x+yay = + 

tire any linear combinations of x and y, and if JR, S are 
the corresponding points (A? Bt), (As,B s ), then («•?(), (r * v ), 
and ((r— u) - (v — u)) are the squares of the distances OP, OS, 
and PS, and the quantity («•») is the scalar product of the 
vectors OB, OS. Every linear combination Xx+PV corre¬ 
sponds to a definite point with coordinates (I a s , +/*&*) i 
if x and y are linearly independent, a one-to-one correspondence 



158 


THE THEORY OP APPROXIMATION 


is established between functions linearly dependent on them 
and the points of a plane, in such a way that there is an 
actual identity between the measures of distance and angle 
and the quantities which were associated with them originally 
by analogy. So far, moreover, there is no need of any space 
of more than two dimensions. 

The correspondence being once established, the analytical 
relations expressing geometric facts are implied with logical 
conclusiveness by the geometric facts themselves. Consider 
for example the problem of determining a constant X to mini¬ 
mize ((y — X,r) • {y — Xx)), when x and y are given. If sc 
and y are interpreted as statistical variables, representing 
deviations from the respective arithmetic means, X is the 
coefficient of regression of y on ,r. The points corresponding 
to x and y being P and Q, as before, the function Xj is 
represented by a point M on the line OP. and y — Xx corre¬ 
sponds to the vector MQ, in the definite sense that when 
y — Xx is expressed as a linear combination of I and >/. 
the coefficients of ? and >/ are the components of the vector. 
It is clear from the geometric figure not only that MQ must 
he perpendicular to OP, or in other words that y — Xx must 
be orthogonal to x, but also that the value of X which 
accomplishes the purpose is given by 


X 


X . OP — OM = OQ cos 6 , 

°Q . „ = (y • y ) v ' 2 («-y) _ (a?«y) 

OP cos v (£C . x y ,. 2 • . x y,i (y . y yja — . x y 


where 6 is the angle POQ, and furthermore that the minimum 
value of ((y — Xx) • (y — Xx)) is 


J/Q* = (OQsin0) a = (y.y)(l— cos 8 ©) = (y . y) (1 — >•*), 


if cos 6, interpreted as a coefficient of correlation, is denoted 
by r. 

For the geometry of three (linearly independent) functions 
x, y, z, let £i, Vi, n be defined as before, and let 


Ci 




(£i • g) 
«!•&) 


n\ 


Cgi: g) 

(Vi’Vi) ’ 


C = £i/(£i-£0 1/2 - 
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Then £,//,£ satisfy the conditions 

(£•'/) = Gf-9 = («•£) = 0. (?•£) = (>/•//) 1. 

and x,y, z are linearly expressible in terns of them. There 
are infinitely many sets of functions satisfying these same 
conditions, and serving equally well for the representation 
of x, y, z, any such set being expressible in terms of the 
particular set by means of the coefficients of an orthog¬ 

onal linear transformation in three variables. Any linear 
combination u = lx-{-yy-\-rz can be expressed in the form 
A2jF-f- Brj + C£, and can thus be put in correspondence with 
a point (A, B, C), the square of whose distance from the 
origin is the quantity («• u) — A 8 -f P a 4-0 2 - If n and i 
are two such linear combinations, corresponding to the points P 
and Q, the cosine of the angle POQ is (tt • ?)/[(.« * n) (r • r)] 1 
The representation of functions by points or vectors is 
particularly convenient for the visualization of coefficients 
of partial aud double correlation. Let x. y. z be three given 
functions, corresponding to points P, Q, K in three-dimensional 
space. If x, y, z are sets of deviations from arithmetic means, 
so that the statistical terminology is appropriate, the co¬ 
efficient of partial correlation between x and y, when z is 
held fast, is the coefficient of correlation between x — and 
y — pz, where I and y are the regression coefficients of x 
on z and of y on & respectively. The function x — Iz is 
a linear combination of x and z. orthogonal to z; its geo¬ 
metric counterpart is a vector in the plane POP, perpendicular 
to OK. Similarly, y — yz is represented by a vector per¬ 
pendicular to OB, and lying in the plane QOK. The angle 
between these vectors measures the dihedral angle formed 
by the two planes. So the coefficient of partial correlation 
in question is the cosine of the dihedral angle. Let Pi, Qi . 
Pi be the points in which the rays OP, OQ, OK pierce the 
unit sphere about the origin as center. In the spherical triangle 
Fi.QiKi, let a, j8, y be the measures of the angles Pi. Q u 
Pi respectively, and let a, h, a be the sides opposite these 
angles. Let r ls , r 18 , r S8 respectively be the ordinary cor- 
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relation coefficients of .r and y. x and z, and y and z; let 
i’i 2 # be the partial correlation coefficient which has just been 
discussed, and let >*i 8 *> and >'331 be the other coefficients of 
partial correlation. It has been seen that rn s = cos y. In 
the same way, r u 2 — cos 0 , r 2S 1 — cos a , while 1 1S == cos c, 
= cos b, r i9 = cos a. By the law of cosines. 


cos y = 


cos r — cos a cos b 
sin a sin b 


which means that 


_ _ >12 - 18 > 28 _ 

This is the standard formula expressing a coefficient of partial 
correlation in ten-ms of ordinary correlation coefficients. The co¬ 
efficients >'i. t 2 , ras 1 of course hare corresponding expressions. 
The inverse formulas 

_ _ ?‘12 8 + >*18 2 >*28 1 


etc., are similarly obtained from the polar triangle. 

For the definition of one of the coefficients of double cor¬ 
relation, parameters A, p are to be determined so as to mini¬ 
mize the quantity ((# — Xy — y>£) - (a* — Xy — yz)), or in 
other words to give the least-square approximation of x by 
means of a linear combination of y and z. This requires that 
x—Xy — /t 2 be orthogonal to both y and z; geometrically, 
the point M representing the combination Xy + /ig is the foot 
of the perpendicular from P on the plane QOR. Then n ^, 
the coefficient of double correlation between x and the pair 
of variables y and z, is the coefficient of simple correlation 
between x and Xy-\-pz, the cosine of the angle MOP between 
OP and the plane QOR. Let this angle be denoted by h. It 
is measured by the arc P t Si, if S t is the foot of the altitude 
from Pi to the side Qi Ri in the spherical triangle. Hence h 
may be calculated as a side of the right triangle RxSiPi, 
by the formula 

sin 7i = sin b sin y. 
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By substitution from the relations 


sin* 6 — 1 — 

sinV = 1 — >* 23 
it is found that 


1 — _L 9 »• *» i» 

x ; 12 ' 13 7 1>3 ~ ~ 7 12 'l$ 7 

(i-^) (1-4,) 


z 1 ag = cos 7/ = (1 — sin* hY* 


(4 + 4; 


13 


■2r u /* u i. 


M2 


(i — 4 ,) 1 ' 2 


There are corresponding formulas for the other double cor¬ 
relation cofficients r* 13 and r». 12 - 

The same figure may be used to obtain formulas for the 
coefficients X and f*>, the partial regression coefficients of 
x on y and z. Let lines be drawn through M parallel to 
OR and OQ, meeting OQ and OR at K and L respectively, 
to form a parallelogram OKML. The vectors OK and OL, 
constituting a resolution of OM into components collinear 
with OQ and OR, represent separately the terms Xy and (*z. 
On the surface of the sphere, let the arcs Rt Si and S\ Qi be 
denoted by d and e, so that d measures the angle LOM, 
while e measures the angle MOK and its equal OM L. (The 
formulation is adapted throughout to the case in which the 
point M falls within the angle QOR, so that 81 is interior 
to the arc QiRi.) In the plane triangle LOM. having two 
of its angles equal to d and e respectively, the third angle, 
at L, is the supplement of <7 + e. But the arcs <1 and e on 
the sphere make up the side a of the original spheiical triangle. 
So the law of sines in the plane triangle gives 


OL _ sin e _ sin e 

OM sin (d + e) ~ sin a ’ 


In the right spherical triangle Pi S L Qi, on the other hand, 


so that 


sine = 


sin c cos ft 
cos h 


OL 


0 M sin c cos 
sin a cos h ’ 


11 
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while OL — (i-OB, and OM= OP cos h. Hence 


OL _ 
11 OR 


OP sin c 
OR sin a 


COS/5, 


or with the use of a conventional notation, presently to be 
explained at greater length, for regression coefficients and 
standard errors of estimate, as well as that already used for 
coefficients of correlation, 


Similarly, 

&12 a 


7 °i 2 

Oxs.2 = —- Ha 2 * 

«8 2 


OPsin b 
OQsma 


cos y — 


Oj 8 
°2 8 


n 2 .)• 


The results of the preceding paragraph can he obtained by 
a possibly less straightforward but more exclusively geometrical 
method, making no use of the spherical triangle or of spherical 
trigonometry as such. Let the letters O, P, Q, R, M, K, L 
have the same signification as before. The letters a, b, r, 
«, ft, y retain their previous meanings as measures of the 
face angles and dihedral angles of the trihedral angle O-PQR. 
Let H be the foot of the perpendicular from M on O Q, and 
J the foot of the perpendicular from R on OQ. The triangles 
KMH and ORJ in the plane QOR are similar, since KM 
is parallel to OR (having been so constructed), MS is parallel 
to RJ (both being perpendicular to OQ), and KS and OJ 
are collinear. Furthermore, KM and OL are opposite sides 
of a parallelogram. So 

OL KM ME 

OR OR RJ' 


But the length of RJ, the perpendicular from R on OQ, is 
OP sin a. Also, PH is perpendicular to OQ, since MP, being 
perpendicular to the plane QOR, is perpendicular to the 
line OQ, and OQ, being perpendicular to MP and ME, is 
perpendicular to their plane and to the line PH in that plane; 
hence MHP is the measure of the dihedral angle ft, so that 
MH — HP cos /S (as PMH is a right angle), while HP 
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— OP sine, making MH — OPsin e cos ft. The resulting 
expression for ft is the same as before. 

The preceding calculations are not restricted in substance 
to the case of statistical variables, but can be formulated 
without the terminology of correlation, and are then applicable 
to any functions x, y, z coming under the original hypotheses. 
It is to be emphasized also that for the time being no use 
has been made of space of more than three dimensions: the 
geometry is the actual geometry of experience. 

3. Coefficients of correlation 
in an arbitrary number of variables 

For dealing with relations of higher complexity it will be 
convenient to modify the notation somewhat. When there 
are m functions to be considered, let them be denoted by 
X!, xs, • • •, x m - The case of primary interest will be that of 
statistical variables, measured from an arithmetic mean in each 
ease; i. e., xg will stand for a set of numbers xn , x&, . • - . x m , 
subject to the condition that x t i + ate-f •••+%»== 0. Apart 
from technical notation and terminology, however, the work 
will be valid for functions of any of the types previously 
considered. As an additional item of notation, (r-.r) will 
be abbreviated to ((*))*. 

The reduction of general coefficients of correlation and 
coefficients of regression to expressions in terms of coefficients 
of lower order depends on the following fundamental prop¬ 
osition: 

Let be determined so as to minimize 

— 1* X% — • • • — Am iCm)) 8 , 

and voith these values of the A’s, let 

03 S. X\ — CPg — • • • Am • 

Let (t t , . • •, ft m and v S) • *., v m be determined so as to minimize 

((fl/l — fig Xg • • • ft m JBm)) 2 and ((®j Vs *Ts * * * Vm, 

and let 

5p ==■ Xi ftgXs -* ftmXm, *p ~ Z's — v » X B V m X M . 

11 * 
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Let A be determined so as to minimize ((<? — A ip))‘. Then 

o) ~ <p — -4 ip. 

The proof depends on the still more fundamental fact, imme¬ 
diately deducible from the definition of orthogonality, that 
tf one function is orthogonal to each of two othei functions, 
it is orthogonal to every linear combination of them. 

It is at once apparent from the definitions of y and ip that 

— / ifi is a linear combination of the ce’s, of the same form 
as eo, the coefficient of a?! being unity in each case; the 
question at issue is that of the identity of the remaining 
coefficients. 

By a theorem discussed early m the chapter, a (necessary 
and) sufficient condition characterizing the coefficients in » 
is that & be orthogonal to each of the functions a%, - • •, x m - 

By the same theorem, y is orthogonal to each of the 
functions *••, x m , and ip is likewise orthogonal to each 
of these functions. Since x A is orthogonal to y and to ip, 
it is orthogonal to the combination y— Aip. The same is 
true of Xi, • • •, x m - In other words, y— A ip is orthogonal 
to each of the functions xs, • • •, a? m . 

By one more application of the theorem, y— A ip is 
orthogonal to ip. But x s can be expressed in the form 

r s = xp-^-vnxz-l -Consequently, being orthogonal 

to ip, Xi, x m , y— Aip is orthogonal also to x-i. 

The identity of y— Aip with a is thus established. 

For m = 3, the proposition is equivalent to a relation of 
perpendiculars which is important in deriving the formulas 
of spherical trigonometry: if OP, OQ, OB are three rays 
issuing from 0, if N is the foot of the perpendicular from P 
on the line OB, if a line is drawn through N in the plane QOB 
perpendicular to OB, and if M is the foot of the perpendicular 
fromPon this line, then MP is perpendicular to the plane QOB. 
This figure yields the equation sin& = sinb sin/, which was 
used in obtaining the formula for a coefficient of double 
correlation. Essentially the same configuration appeared also 
in the discussion of partial regression coefficients. 
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For the statistical application with an arbitrary value of m 
a somewhat elaborate notation is appropriate. Let the func¬ 
tion to, as originally defined by the formula xi —- 

according to the least-square criterion, be denoted by _>s 
It may be spoken of as the residual of .r, trilli respect to 
x%, cca, • ••, x m . (The order of the subscripts 2, 3. •••. m 
among themselves is clearly immaterial.) Similarly. <*- and </'. 
the residuals of Xt and of x s with respect to ./■». • - 
are to be denoted by xi . t . and xm . m . The stundat d 
deviation ox of any one of the original variables ax is defined 
by the equation er| = ((* fc )) 2 /«- (The presence of the 
denominator n is a mere matter of definition, as far as the 
present discussion is concerned, since the equations will in¬ 
volve only ratios of standard deviations; except for the 
statistical interpretation, the (positive) square root of ((a:)) 3 
itself may be used in place of aj t .) The correspondingly 
defined standard deviation of such a residual as av.» , the 
standard enror of estimate of Xi m terms of xi. x m . is 
denoted by oi 2 a - m • The partial regression coefficients 

, 1-8, • are represented by & J2 s4 m, &i3 24..»n, •••. 

fam.23 ..«n-i. The first subscript of each of the It's is that 
of the variable approximated, the second is that of the 
particular variable to which the coefficient in question is 
attached in the regression formula, and the other subscripts, 
the order of which among themselves is without significance, 
are those of the remaining variables. In the same way, 
Ps, • • •, f*m are to be replaced by &13 4 ■ m, — , bim.s «i-i* 
and v», • • •, v m by 4 m, • • - , fom a-.m-i- The coefficient 
of correlation between $p and ip, or, in the present notation, 
between xis.. m and xi 8 . m , is the coefficient of partial 
correlation between X\ and x± when x%, * • •, x m are held fast, 
and is represented by n 2 » 

In the new notation, the general proposition about the 
identity of the functions previously called a> and g>—xiifi 
asserts that is the same as the residual of ox.s m 

with respect to x%.s . m . It is important to bear in mind 
however that its essential content is independent of the 
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number of variables and their individual designations, and 
this essential content is more adequately though less concisely 
expressed by saying: 

The residual' of a given function u?th respect to a set of func¬ 
tions may be obtained by calculating successively the residuals 
of the given function and of any chosen function of the set 
until respect to the remaining functions of the set, a/nd then 
taking the residual of the formei of these residuals with respect 
to the lattes'. 

An ealier paragraph contained a derivation of the value 
of the simple regression coefficient minimizing the expression 
{{y — lx) • (y — lx)) . The formula of that paragraph which 
reads 

j _ (y-y) 1/2 -2/) 

(x • x) 1/2 ' (x • x) 1/2 (y • y) 112 

becomes in the present notation = (<r 2 /oi)r 12 . Interchange 
of the variables gives = (oi/tf 2 ) r 12 ; the simple coefficient 
of correlation is symmetrical in its two variables, so that 
r 21 = > 12 , while b ix and bn are different. Incidentally it 
appears that r 12 = (b 12 6 S i) 1/2 - 

Corresponding formulas for partial regression coefficients 
can be obtained immediately. It is apparent on inspection 
that while the coefficients of a&, x m in the function 
xi 2 s-. m — «, as expressed in the form y — A ip, are com¬ 
binations of the fi’s and the r’s, the coefficient l s — 6 x 2 8 ■■m is 
merely A. But this A is the simple coefficient of regression 
of y with respect to if /, and as such is expressible in terms 
of the standard deviations of y and ip and the correlation 
between them. The standard deviations of y and ip are 
and 028 -m. respectively, and their coefficient of cor¬ 
relation is j- 12 . 8 . m . Consequently 

t 01 8 -. m 

Oi 2 8 rn — — -fi2.8--.tfl. 

°2 8 . m 

By interchange of subscripts—in other words, by consideration 
of the coefficient of regression of x% z ,. m with respect to 
xi, s ...m —it is found that 
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/, _ . 1 - M 

^21 3 ■m - ~ Cl2 .t m 

®1 3 •ml 

the definition of the correlation, coefficient is symmetrical 
with respect to the first two subscripts. Combination of the 
equations for the V s gives 

Tl2 8 m — (612 3. vi &21 8 m) 11 ' 2 • 

In the earlier paragraph to which reference was made above 
the minimum of {{y — lx) -{y — lx)) was evaluated in the 
form (y • y) (1 — r a ). In the new notation this result is ex¬ 
pressed by the equation cr| t — of (1 — rf 2 ). The corresponding 
equation with subscripts reversed is erf 3 == <rf (1 —rf 2 ). Applied 
to the standard deviation of « = y — Atp, regarded as the 
residual of with respect to */'• it becomes 

O'2 —— /1 w2 N 

• -m 1 :* w 'IS 8- * 

Here again it is important to recognize the essential content 
of the formula, as distinguished from the notation in which 
it is expressed. Written down successively for cases of in¬ 
creasing complexity, with a particular choice as to the dis¬ 
position of subscripts each time, it yields 

ff i. 2 = *i 2 )> 

ff 1.28 ~ G 1 82 = ®1 a O *13 2)» 

(*2 —- (1 - /-2 \ 

1*284 428 1 28 V* ' 14.23' * 

. . »■•••••••• 

°1 28 • m === °1 m38 ™ °1 2:1 1 ^ } lm 23 •»*—l) ? 

and by combination of all these equations 

01 ,28 m = O'! (l (l ^18 2) (1 ^*14 2s) " • • (l — >'lwi.23- • l) • 

Reverting to an abbreviated notation for certain residuals, 
but with a modification of that previously employed, let 

0>X = #1 84 0)2 = ^2.34 -m? 

5Pl = #L4. -m? 9^2 = #2 4- -m? ?P8 — #8.4 • m- 
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By the fundamental theorem about residuals, may be re¬ 
garded as the residual of tp x with respect to g>», and is 
the residual of with respect to sp<s- Hence the coefficient 
of correlation between »i and o > a is the coefficient of partial 
correlation between spi and when <p& is held fast , and is 
expressed in terms of the simple correlations between y>i, y* 
and y 8 by the formula previously obtained in the discussion 
of correlations among three variables. If the correlation 
between <b x and a> s is denoted for the moment simply by r f , 
and if the correlations between g> L and <p s , 9>i and gp 3 , and y s 
and <ps are called respectively 112, r'u and r23, then 

, _ rj 2—?'i 3 r'23 

~ [(l-r®(l-r£)r ■ 

But from the point of view of the dependence of these 
quantities on x 1, xz, • •*, x m , 

5 ru 4 m, ris = ri 3 4 . m, r'm — r%z t , m , 

m 

?*12 4 * m ^13 4 . m ^28 4 m 
[(1 r i8 4 Ml) (1 r 2S 4 J] X/ - 

By the last equation any partial correlation coefficient can 
be calculated in terms of correlation coefficients of lower 
order, that is to say, correlation coefficients involving a smaller 
number of variables, and so ultimately in terms of ordinary 
correlation coefficients. 

Let the partial regression coefficients of x x with respect 
to 352, xt, • ••, Xm be denoted once more by h, %a, •»•, h m , 
and let 0 — ^2 X2~h hg xh-{- ... -\-h m x m . The coefficient 
of correlation between x, and <D is the coefficient of multiple 
correlation between x\ and the set of variables x%, • • *, Xm- 
It may be represented by n.ss m- If the regression coefficient 
of Xt on O is L, the square of the standard deviation of 
xi — L O is a* (1 —*1 m .. mi) . But any constant multiple of 0 
is a linear combination of xz, ♦ • •, Xm, and by the definition 


r' = ri2 84 ..»i; r'n - 

and consequently 

^12.84 m === 
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of <Z> no other linear combination of x 3 , x m can give so 
good an approximation to Xy according to the least-square 
criterion as O itself. Hence it must be that L = 1, and the 
standard deviation of x 1 —LO is the standard deviation of 
xx —©, already denoted by ay m , so that 


7 ^ 

l 28 m 


»;a»)• 


Taken in conjunction with an equation previously obtained 
for of 23 m , this shows that 


1- r. 


1 23 


= ( 1 — ( 1 ‘ 




i-i) • 


a relation from which the coefficient of multiple con-elation 
can be calculated. 


4. The geometry of frequency functions 

Apart from the generality which the preceding account of 
the application of geometry to analysis possesses by virtue 
of the fact that at the outset the independent variable may 
be one taking on a finite number, an enumerable infinity, or 
a continuous infinity of values, its substance can be given 
still another setting. In the statistical case, if there are for 
example just two functions concerned, xu= x(k) and yt— y(k\ 
the formulas involve the different values of k symmetrically, 
and are not effected if the n pairs of numbers (r*. //k) axe 
permuted, each pair by itself being kept inviolate. That is 
to say, the independent variable serves only to define the 
association of a value of x with a value of y, and has no 
further significance of its own. For dealing with large 
numbers of observations, the problem may be idealized by 
supposing that (within limits, perhaps) any value of x may 
be associated with any value of y, but that some pairs of 
values (x, y) occur oftener than others, to an extent indicated 
by a frequency function y (a?, y), whose integral over any 
region of the x, y plane measures the frequency of pairs of 
observations (sc, y) falling within that region. If x and y 
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would otherwise appear as denominator, is 1. The coefficient 
of correlation between x and y is J*W[[(£ r M (JW)] 1 ''’ 

and the other coefficients of correlation are correspondingly 
defined. 

As a first step toward the setting up of a geometrical 
representation, let auxiliary notation be introduced as follows- 

11/2 

I = * 1 , 

Jx 

^ (s/„2)x— (c/^) r, [J* z'f 

In these formulas X, Y, and Z, as well as y, are to be 
thought of as functions of x, y, and z, and the integration 
as performed with regard to these variables. As the func¬ 
tional determinant of X, Y, Z with respect to x, y, z is 1 . 
however, the integrals may equally well be taken with regard 
to X, r, Z, if the limits of integration are suitably adjusted, 
or, what comes to the same thing (since each integrand is 
identically zero except over a finite region), if the integrals 
are extended over the whole of space. As a function of 
X, r, Z, let <p{x, y, z) be denoted by sPx(X, Y, Z). Then, 
with X-, Y, Z as variables of integration, 

f XY 9 i =/ XZfi = J YZ 9l = 0. 

Let 

= XI 1,1 = YK, Ci = Z/o h , 

and let the quantity c x t x y (a;, y, z) — <f x * x a* $Pi (X, X, Z), as 
a function of the new variables, he represented by ®i(£Fi, Cib 
In terms of £i,i, x , £i as variables of integration, the relations 

f ?i ©i =-- J Stk ©i = /niti ©i=o 



Y=y-{A/<s?)X, [J 
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are satisfied, together with the equations 

f fiVi =/i =S&° 1 = 

while m (p (a?, y, z) dx dy dz = 1 goes over into 
m ®i G?i, >h, £i) dvii dtt = 1. 


If rti£i + Mi + Cx£i and a % -f- h Vi + <*s £i are an y two 
linear combinations of h, Vi, £t> 

Ci Ci)(&s§i'"f - &a^i - |- Ca Ci) ®i = flti Oa-f" &i ^a - ! - CiCs* 

Finally, for the sake of generality, let £, *1, £ be any 
variables expressible in terms of ?i, , Ci by means of a 

(normalized) orthogonal transformation. Let O x Q?i, Vi> £i) 
= CD (£,*/, £). As the determinant of the transformation is 
± 1, the equations of the preceding paragraph, including the 
last one as formulated with the various sets of coefficients 
involved in the transformation, give 


= J*£ £ (D = J%£<D = 0, 

J CD = J £*<Z> = j* 1*0 =J £ 2 <D = 1, 


the variables of integration now being '§> *1, £. The basis 
of the geometrical interpretation is the possibility of finding 
£,«?,£ as linearly independent linear combinations of,#, y, z, 
so that these relations are satisfied. 

The equations expressing £,-?/,£ in terms of x, y, z mani¬ 
festly are in fact linearly independent, and x, y, z consequently 
can be linearly expressed in terms of £, *?, £. Furthermore, 
any linear combination of x,y, z can be similarly expressed. 
Let 

U — a 1 x + fliy-+riz — i.i£+J?i«?-bCi£ 
be any such combination. Then 

dxdydz = ///C4 £+ -Bi y+ C t O a ®d£dy d£ 
= A? + J^+Ci 2 . 




V. GEOMETBY OF FUNCTION SPACE 


173 


If 


V — "sx-j-fts y-\-YiS. = A a £ 4 - B* >t 4 - Ci & 


is any second combination of the same form. 


fffvrr dx dy (l 2 — »4i jig 4“ Pi Pa 4” fi C s» 


Let P be the point until coordinates (JLi. B l} Ci), and Q the 
point (J. 2 , JB a . Cs), with reference to a system of rectangitlai 
axes in three dimensions, and let 0 be the origin. Then 

JV*y and f V s <P, the variables of integration being x . y, z. are 
the squares of the distances OP and OQ. and 


f uv *i[{J v '*){S T 'r)Y’ 


the coefficient of correlation between TJ und T", is the cosine 
of the angle POQ. Through the medium of the equations 
expressing x, y, z in terms of H,*}, £, each of the variables 
x, y, z, and every linear combination of them, can be associated 
with a definite point in three-dimensional space , in such a tray 
that standard deviation and coefficient of correlation have the 
same sort of geometrical meaning as before. Prom this begin¬ 
ning the geometrical structure of the theory of correlation 
can be built up along the lines previously followed. 


5. Vector analysis in function space 

There is further scope for the application of simple geometric 
ideas in functional analysis, where the complete picture calls 
for a geometry of infinitely many dimensions. For a single 
illustration (discussed by Ldvy, Lemons d?analyse fonctionndle, 
Paris, 1922, pp. 127-128), consider the integral 

£2 = £i(y) = e £ F(x, y, y') dx, 

where y is a (suitably) arbitrary function of x,y — /(a ?). 
and F is a given function of its arguments. Not to enter 
into details with regard to questions of continuity, let it be 
assumed that all the functions that appear in the discussion 
have as many continuous derivatives as are needed to justify 
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the operations performed. If y is looked upon as a point in 
a function space of infinitely many dimensions, or as a vector 
from the origin to the point, Slip) is a scalar point function 
in that space. Subject to the appropriate conditions of diffe- 
rentiability, let y(x) be an arbitrary function vanishing at 
n and at 6, and h an arbitrary constant. The familiar process 
of differentiation gives 

[-Jy Sl(y-\-h = X y> y'^ + >i Fy'{x, y, y)1 do- 

^ ^ a V^Fy{x,y,y , ) — ~^F V ’{x,y, y, ^dx. 

the last expression resulting from an integration by parts 
performed on the second term in the brackets, with use 
of the fact that y (a) = yQ>) = 0. Let the function 
F y —( d/dx)F y ' be denoted by 9 >(x). Then the expression 
for the derivative may be abbreviated to J* y y dx, or, in 
a notation previously employed, (y-q>). The variation hy may 
be regarded as an infinitesimal vector increment of the vector y 
in function space, of geometric magnitude h (y • y) 112 . If the 
increment of SI is divided by this quantity, instead of h. 
passage to the lijnit gives a directional derivative in the 
direction of the vector y. Its .value is iy • ?)/(v • ?) 1/a . But 
this can be written in the form ** 

from which it appears that the directional derivative in the 
direction y is equal to the quantity (go • gp ) 1/2 multiplied by the 
cosine of the angle between, the vectors y and y >, being greatest 
when y is coUinear with <p. The “functional derivative” <p. 
considered as a vector in function space, thus has the character' 
of a gradient of the scalar point function SI. 

The functional derivative can also be obtained formally 
after the analogy of the ordinary representation of a gradient 
in terms of a rectangular coordinate system. Let it be 
supposed now that y = fix) itself vanishes at the ends of 
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the interval (a, 6), and that it is expanded in a series of 
normalized orthogonal functions u k (x), each of which likewise 
vanishes at the ends of the interval, in the form 

f(x) — ai n 1 (x) + «a >h (as) -f- 


Let it be assumed further that this series admits differen¬ 
tiation term by term. The integral Si is a function of the 
infinitely many variables (a lt a s , * • •)> the coordinates of the 
point y with respect to a set of rectangular axes in space 
of an enumerable infinity of dimensions. The derivative of Si 
with regard to a k is 

~d 

F{x,y,y')dx 

a O CLlc 

— J a [F y {x, y, y') u JC (x) J r Fy' (x,y, y') n k (a*)] dx 
a [Fy{x, y, y') — (x, y, y')] u k (x)dx 

— X ^ Uk ^ ‘ 


This is the Fourier coefficient of <p(x) with respect to u k (x). 
The expression 




for a vector having the component dSi/da k in the direction 
of the corresponding coordinate axis is the formal expansion 
of the functioned, derivative <p according to the orthogonal 
system (u k ). 

An exposition of other elementary developments of the 
vector analysis of function space, rendered concrete by the 
use of theorems on the convergence and degree of convergence 
of certain expansions in series of orthogonal functions, has 
been given by the author elsewhere (Annals of Mathematics, (2), 
vol. 27 (1926), pp. 551-567; Bulletin of the American Mathe¬ 
matical Society, vol. 32 (1926), pp. 641-643). 
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